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ABSTRACT 


The present disclosure relates to methods of preparing 
cell-based products for consumption that comprise proteins 
from exotic, endangered, and extinct species, as well as 
cell-based products for consumption. 
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GENERATION OF CELL-BASED PRODUCTS 
FOR CONSUMPTION THAT COMPRISE 
PROTEINS FROM EXOTIC, ENDANGERED, 
AND EXTINCT SPECIES 


PRIORITY 


[0001] This application claims priority to U.S. Provisional 
Application No. 63/171,605, filed Apr. 7, 2021, the entire 
content of which is incorporated herein by reference. 


SEQUENCE LISTING 


[0002] The instant application contains a Sequence Listing 
which has been submitted electronically in ASCII format 
and is hereby incorporated by reference in its entirety. Said 
ASCII copy, created on Jun. 10, 2022, is named UF_023_ 
01_WO_SL.txt and is 146,956 bytes in size. Said ASCII 
copy includes in a single file ASCII copies, created on Apr. 
4, 2022, are named (1) ON076011 amino acid sequence 
(SEQ ID NO:3) (2 KB in size), (2) ON076011 nucleotide 
sequence (SEQ ID NO:2) (5 KB in size), (3) ON076012 
amino acid sequence (SEQ ID NO: 1) (2 KB in size), (4) 
ON076012 nucleotide sequence (SEQ ID NO: 4) (4 KB in 
size), (5) ON076013 amino acid sequence (SEQ ID NO: 5) 
(2 KB in size), and (6) ON076013 nucleotide sequence 
(SEQ ID NO: 6)(6 KB in size). 


FIELD 


[0003] This invention is in the field of cell-based products 
for consumption, in particular, products that comprise pro- 
teins derived from exotic, endangered, and/or extinct spe- 
cies. The present disclosure relates to novel consumable 
products and methods of preparing such consumable prod- 
ucts. 


BACKGROUND 


[0004] As the world’s population continues to grow, the 
need for products for consumption is greater than ever. 
Given the expanding population, the existing market of 
conventional products is struggling to meet the demand. In 
vitro produced cell-based products for consumption have 
emerged as an attractive option to supplement the demand 
for conventional products. Moreover, in vitro produced 
cell-based products for consumption help alleviate several 
drawbacks linked to conventional products and help address 
controversial issues associated with, e.g., conventional meat 
production. Some known factors associated with conven- 
tional meat production include low conversion of caloric 
inputs to edible nutrients, microbial contamination of prod- 
uct, emergence and propagation of veterinary and zoonotic 
diseases, relative natural resource requirements, and indus- 
trial pollutants, such as greenhouse gas emissions and nitro- 
gen waste streams. 

[0005] Thus, it is an object of the invention to provide 
methods of preparing in vitro produced cell-based products 
for consumption. In particular, such cell-based products may 
comprise peptides and proteins, in addition to lipophilic 
molecules such as fatty acids, lipids, steroids and their 
metabolic precursors from exotic, endangered, and extinct 
species. To date, cell-based products that comprise such 
molecules have never been prepared, thus demonstrating the 
novelty of the invention. Cell-based consumption products 
comprising proteins from exotic, endangered, and/or extinct 
species may elicit a number of benefits. These may include, 
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for example, generation of unique nutrition profiles and 
organoleptic profiles never before observed (i.e., from 
extinct species), to provide options displacing the unsus- 
tainable harvest or poaching (i.e., of wild or threatened 
species), and for generating marketing appeal for new prod- 
uct alternatives that can potentially displace consumption of 
conventional animal products manufactured by means 
requiring animal husbandry and slaughter. 


SUMMARY 


[0006] This invention generally relates to methods of 
preparing in vitro produced cell-based products for con- 
sumption comprising proteins from exotic, endangered, and 
extinct species. By way of example, the cell-based products 
may be meat products. 

[0007] Ina first embodiment, DNA constructs encoding 
for proteins from extinct, exotic, or endangered species may 
be cloned and expressed in agriculturally or nutritionally- 
relevant host animal cells. Alternatively, the host cell may 
come from yeast or bacteria. These transfected cells may 
then form the basis for generation of cell-based products for 
consumption. In a preferred embodiment, the DNA con- 
structs derived from the extinct, exotic, or endangered 
species may be physically integrated into the host animal 
cell genome via various genomic engineering mechanisms 
prior to expression. Alternatively, protein from extinct, 
exotic, or endangered species may be expressed in other 
hosts, such as eukaryotic, microbial, or yeast systems, and 
subsequently added into cell-based products for consump- 
tion. Such protein may, in separate embodiments, be 
expressed in vivo. 

[0008] Inasecond embodiment, cells from distinct species 
related by a distant, common ancestor (e.g., an extinct, 
exotic, or endangered ancestral species) may be fused 
together to form syncytial heterokaryons. Formation of these 
syncytial heterokaryons may induce expression of genes 
from the common ancestral species that were silenced over 
the course of evolution. 


DESCRIPTION OF DRAWINGS 


[0009] This patent or application file contains at least one 
drawing prepared in color. Copies of this patent or patent 
application publication with color drawing(s) will be pro- 
vided by the Office upon request and payment of the 
necessary fee. 

[0010] FIG. 1 depicts a flow chart exhibiting how food 
products comprising proteins from extinct, exotic, or endan- 
gered species are prepared when DNA constructs encoding 
genes from such species are transfected into agriculturally or 
nutritionally-relevant host animal, yeast, or bacterial cells. 
[0011] FIG. 2 illustrates the trans-dominant modulators 
involved (avian trans-dominant activators and repressors; 
crocodilian trans-dominant activators and repressors) upon 
formation of heterokaryons from crocodilian and avian 
extant species. 

[0012] FIG. 3 provides a high-level schematic of how 
gene expression proceeds upon formation of heterokaryons 
from crocodilian and avian extant species. 

[0013] FIGS. 4B-1, 4B-2, 4B-3, and 4B-4 depict Tyran- 
nosaurus Rex (POC2W2.2) and Brachylophosaurus 
canadensis (P86289.1) collagen peptide sequences mapped 
by alignment against the protein sequence for chicken 
collagen 1A1 (COL1A1, NC_052558.1) to form a consen- 
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sus Dinosaur COL1A1 (ON076011) protein sequence. 
Shown are COL1A1 amino acid substitutions unique to 
Tyrannosaurus Rex and Brachylophosaurus canadensis. In 
particular, substitutions P-406-A and S-407-P from Tyran- 
nosaurus Rex and P-776-S and A-779-P from Brachylopho- 
saurus canadensis have been identified. FIGS. 4B-1-4B-4 
disclose SEQ ID NOS 7, 3, and 8-10, respectively, in order 
of appearance. 


[0014] FIGS. 4A-1, 4A-2, 4A-3, and 4A-4 depict Tyran- 
nosaurus Rex (POC2W4.1) and  Brachylophosaurus 
canadensis (P86290.1) collagen peptide sequences mapped 
by alignment against the peptide sequence for chicken 
collagen 1A2 (COL1A2, NC_052533.1) to form a consen- 
sus Dinosaur COL1A2 (ON076012) protein sequence. 
Shown are COLIA2 amino acid substitutions unique to 
Tyrannosaurus Rex and Brachylophosaurus canadensis. In 
particular, substitutions l-605-P from Tyrannosaurus Rex 
and A-311-T from Brachylophosaurus canadensis have 
been identified. FIGS. 4A-1-4A-4 disclose SEQ ID NOS 15, 
1, and Il-13, respectively, in order of appearance. 


[0015] FIGS. 5A-5B depict cloning of dinosaur collagen 
1A1 (A) and 142 (B) into PhiC31 vectors. Dinosaur-specific 
COLIAI (P-406-A, S-407-P, P-776-S and A-779-P) amino 
acid substitutions were inserted into the chicken COL1A1 
reference protein sequence, chicken codon optimized, and 
designated Dinosaur COL1A1 (ON076011). Dinosaur-spe- 
cific COL1A2 (l-605-P, A-311-T) amino acid substitutions 
were inserted into the chicken COL1A2 reference protein 
sequence, chicken codon optimized, and designated Dino- 
saur COL1A2 (ON076012). In FIG. SA, the Dinosaur 
COLIAI gene sequence was chicken codon optimized and 
cloned into a PhiC31 vector under the control of a EFla 
promoter. In FIG. 5B, the Dinosaur COL1A2 gene sequence 
was codon optimized and cloned into a PhiC31 vector under 
control of a PGK promoter. The Dinosaur COL1A1 and 1A2 
vectors were used to develop stable expression in cultured 
chicken cells for cultured meat. 


[0016] FIG. 6 depicts cultured chicken cells transfected 
with Dinosaur collagen 1A1 and 1A2 gene vectors. Dino- 
saur collagen 1A1 (ON076011) and 1A2 (ON076012) gene 
vectors or empty vectors were transfected with integrase into 
cultured chicken cells. A naive sample served as a control. 
Cells with stable fluorescent reporter expression were 
selected by cell sorting and seeded into a 12-well plate and 
imaged at 10x magnification under bright field, Texas Red 
(615 nm; RFP) and Green Fluorescence (509 nm; GFP). 


[0017] FIGS. 7A-7B depict Dinosaur collagen COL1A1 
(A) and COL1A2 (B) transcript expression in cultured 
chicken cells. The relative expression of exogenous Dino- 
saur COL1A1 (7A) and COL1A2 (7B) was confirmed by 
quantitative PCR in cultured chicken cells transfected and 
selected for stable expression of both Dinosaur genes. The 
relative expression levels of the codon optimized Dinosaur 
collagen 1A1 and 1A2 genes are based on delta Ct 
(ACt=AChpisosaur cor a127 ACt GAPDH) calculation with 
endogenous GAPDH mRNA as the reference transcript. The 
expression of Dinosaur COL1A1 and COL1A2 relative to 
the expression of endogenous GAPDH is presented as 
2747. GAPDH has been assigned an expression level of 1. 
Relative expression levels of COL1A1 (A) and COL1A2 (B) 
above 1 indicate genes are present at higher levels than 
GAPDH, while relative expression levels below 1 indicate 
gene levels are less than GAPDH. 
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[0018] FIG. 8 depicts collagen staining in cultured chicken 
cells expressing Dinosaur collagen 1A1 and 1A2 genes. 
Cultured chicken cells were transfected with Dinosaur col- 
lagen 1A1 and 1A2 gene vectors accompanied by red 
fluorescent protein and green fluorescent protein reporters, 
respectively. Cells were co-transfected with integrase and 
selected for stable gene expression. 96-well plates were 
stained with collagen I primary antibody followed by an 
Alexa Fluor 350 secondary antibody. 

[0019] FIGS. 9A, 9B, 9C, and 9D depict identification of 
Mammoth-specific protein coding variations in the myosin 
heavy chain gene MYH13. Mammoth and elephant genomes 
were analyzed for protein coding non-synonymous single 
nucleotide polymorphisms (SNPs) that only exist in Mam- 
moth myosin genes. Elephant genomes examined include 
African elephant (L africana B, L africana C, LoxAfr 
3.0), Asian elephant (E maximus D, E maximus E), For- 
est Elephant (L cyclotis A, L cyclotis F), Straight tusked 
elephant (P antiquus N, P antiquus O). Mammoth 
genomes included in the analysis were Woolly Mammoth 
(Wrangle, Oimyakon, M primigenius G, M_primigenius_ 
H, M primigenius S, M primigenius V) and Columbian 
Mammoth (M columbi U). Mammoth-specific protein cod- 
ing non-synonymous SNPs were identified in myosin heavy 
chain gene MYH13 to include T-1306-S, N-1668-D, 
A-1914-V. These amino acid substitutions did not exist in 
any elephant genomes analyzed. The identified amino acid 
substitutions were applied to the Loxodonta africana 
MYH13 sequence (XP 003416837.1) to create Mammoth 
MYH13 (ON076013). FIGS. 9A-9D disclose SEQ ID NOS 
14, 5 and 16, respectively, in order of appearance. 

[0020] FIGS. 10A-10B depict a codon optimized Mam- 
moth MYH13 gene. The mammoth myosin heavy chain 
gene (MYH13) containing amino acids 1914-V, 1668-D, and 
1306-S was created by mapping the amino acid substitutions 
to the Loxodonta africana reference genome. The nucleotide 
sequence was chicken codon optimized for expression in 
cultured chicken cells. FIGS. 10A-10B disclose SEQ ID 
NO:6. 

[0021] FIG.11 depicts a mammoth MYH13 gene inserted 
into a PhiC31 vector. The mammoth-specific MYH13 con- 
taining 1914-V, 1668-D, and 1306-S amino acids was 
encoded in the Loxodonta africana MYH13 reference 
sequence. The nucleotide sequence was chicken codon opti- 
mized and cloned into a PhiC31-RFP vector for stable 
expression in cultured chicken cells developed for cultured 
meat. 

[0022] FIG. 12 depicts cultured chicken cells transfected 
with a Mammoth MYH13 gene vector (ON076013). Mam- 
moth MYH13 was expressed along with a red fluorescent 
reporter (RFP) in cultured chicken cells using the PhiC31 
vector system. Mammoth MYH13 gene vector or empty 
vector was transfected with integrase into cultured chicken 
cells. A naive sample served as a control. Cells with stable 
expression were selected and seeded into a 12-well plate and 
imaged at 10x magnification under bright field, Texas Red 
(615 nm; RFP) and Green Fluorescence (509 nm; GFP). The 
naive sample (non-transfected control; top panel) showed no 
fluorescence when imaged in the RFP channel. An RFP 
empty vector (PhiC3 l-RFP) is depicted in the middle panel. 
[0023] FIG. 13 depicts Mammoth myosin MYH13 tran- 
script expression in cultured chicken cells. The relative 
expression of exogenous Mammoth MYH13 was confirmed 
bv quantitative PCR in cultured chicken cells transfected 
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and selected for stable expression. The relative expression 
level of codon optimized Mammoth MYH13 mRNA is 
based on delta Ct (ACt=ACtrammon my 3 ACt GAPDH) 
calculation with endogenous GAPDH mRNA as the refer- 
ence transcript. The expression of Mammoth MYH]13 rela- 
tive to the expression of endogenous GAPDH is presented as 
p-ACT 

(0024) FIG. 14 depicts Mammoth MYH13 staining in 
cultured chicken cells expressing the Mammoth MYH13 
gene. Cultured chicken cells were transfected with Mam- 
moth MYH13 gene vector accompanied by a red fluorescent 
protein reporter. Cells were co-transfected with integrase 
and selected for stable gene expression. 96-well plates were 
stained with MYH13 primary antibody followed by an 
Alexa Fluor 488 secondary antibody. Cell nuclei were 
stained with DAPI. GAPDH has been assigned an expres- 
sion level of 1. Relative expression levels of MYH13 above 
1 indicate genes are present at higher levels than GAPDH, 
while relative expression levels below 1 indicate gene levels 
are less than GAPDH. 


DETAILED DESCRIPTION OF THE 
INVENTION 


[0025] Provided herein are methods and compositions 
related to the in vitro production of cell-based products for 
consumption comprising proteins from exotic, endangered, 
and extinct species. For further detail regarding in vitro 
production of cell-based products for consumption, please 
reference U.S. Publication No. 2021/0235733 A1, the entire 
content of which is incorporated herein. 

[0026] Before describing particular embodiments in 
detail, it is to be understood that the disclosure is not limited 
to the particular embodiments described herein, which can 
vary. It is also to be understood that the terminology used 
herein is for the purpose of describing particular illustrative 
embodiments only and is not intended to be limiting unless 
otherwise defined. The terms used in this specification 
generally have their ordinary meaning in the art, within the 
context of this disclosure and in the specific context where 
each term is used. Certain terms are discussed below or 
elsewhere in the specification, to provide additional guid- 
ance to the practitioner in describing the compositions and 
methods of the invention and how to make and use them. 
The scope and meaning of any use of a term will be apparent 
from the specific context in which the term is used. As such, 
the definitions set forth herein are intended to provide 
illustrative guidance in ascertaining particular embodiments 
of the invention, without limitation to particular composi- 
tions or biological systems. 

[0027] As used in the present disclosure and the appended 
claims, the singular forms “a,” “an” and “the” include plural 
references unless the content clearly dictates otherwise. 
[0028] Throughout the present disclosure and the 
appended claims, unless the context requires otherwise, the 
word “comprise”, or variations such as “comprises” or 
“comprising”, will be understood to imply the inclusion of 
a stated element or group of elements but not the exclusion 
of any other element or group of elements. 

[0029] Unless specific definitions are provided, the 
nomenclature utilized in connection with, and the laboratory 
procedures and techniques of, molecular biology, cell biol- 
ogy, analytical chemistry, and synthetic organic chemistry 
described herein are those well-known and commonly used 
in the art. Standard techniques may be used for recombinant 
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technology, molecular, biological, microbiological, chemi- 
cal syntheses, and chemical analyses. 


Generation of Cell-Based Products for Consumption 


[0030] Provided herein are methods to produce in vitro 
cell-based products for consumption. 


Cells 


[0031] The cell-based products for consumption of the 
disclosure are compositions produced by the in vitro cultur- 
ing of naturally occurring, transgenic, or modified animal 
cells in culture. In alternative embodiments, cell-based prod- 
ucts for consumption may be prepared by culture in various 
recombinant protein expression systems, such as microbial 
or yeast systems. 

[0032] The cells used in the methods of the present 
disclosure can be primary cells, or modified cell lines, such 
as adapted or immortalized cell lines. The methods provided 
herein are applicable to any metazoan cell in culture. Gen- 
erally, the cells are from any metazoan species whose tissues 
are suitable for dietary consumption and demonstrate the 
capacity for skeletal muscle tissue specification. 

[0033] In some embodiments, the cells are derived from 
any non-human animal species intended for human or non- 
human dietary consumption, for example, cells of avian, 
ovine, caprine, porcine, bovine, and piscine origin, or, in 
other words, cells of livestock, poultry, avian, game, or 
aquatic species. 

[0034] In some embodiments, the cells are from livestock, 
such as domestic cattle, pigs, sheep, goats, camels, water 
buffalo, rabbits and the like. In some embodiments, the cells 
are from poultry, such as domestic chicken, turkeys, ducks, 
geese, pigeons and the like. In some embodiments, the cells 
are from game species, such as wild deer, gallinaceous fowl, 
waterfowl, hare and the like. In some embodiments, the cells 
are from aquatic species or semi-aquatic species harvested 
commercially from wild fisheries or aquaculture operations, 
or for sport, including certain fish, crustaceans, mollusks, 
cephalopods, cetaceans, crocodilians, turtles, frogs and the 
like. 

[0035] In some embodiments, the cells are from exotic, 
conserved or endangered animal species. In some embodi- 
ments, the cells are from Gallus gallus, Gallus domesticus, 
Bos taurus, Sous scrofa, Meleagris gallopavo, Anas 
platyrynchos, Salmo salar, Thunnus thynnus, Ovis aries, 
Coturnix, Capra aegagrus hircus, or Homarus americanus. 
[0036] In some embodiments, the cells are primary stem 
cells, self-renewing stem cells, embryonic stem cells, 
pluripotent stem cells, induced pluripotent stem cells, or 
trans-differentiated pluripotent stem cells. 

[0037] In some embodiments, the cells are modifiable by 
a genetic switch to induce rapid and efficient conversion of 
the cells to skeletal muscle for cultured production. 

[0038] Insome embodiments, the cells are myogenic cells 
destined to become muscle or muscle-like cells. In some 
embodiments, the myogenic cells are natively myogenic, 
e.g., myoblasts. Natively myogenic cells include, but are not 
limited to, myoblasts, myocytes, satellite cells, side popu- 
lation cells, muscle derived stem cells, mesenchymal stem 
cells, myogenic pericytes, or mesangioblasts. 

[0039] In some embodiments, cells are of skeletal muscle 
lineage. Cells of skeletal muscle lineage include myoblasts, 
myocytes, and skeletal muscle progenitor cells, also called 
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myogenic progenitors that include satellite cells, side popu- 
lation cells, muscle derived stem cells, mesenchvmal stem 
cells, mvogenic pericvtes, and mesangioblasts. 


[0040] In some embodiments, the cells are non-myogenic, 
and such non-mvogenic cells can be programmed to be 
mvogenic, for example the cells mav comprise fibroblasts 
modified to express one or more mvogenic transcription 
factors. In exemplarv embodiments, the myogenic transcrip- 
tion factors include MYOD1, MYOG, MYF5, MYF6, 
PAX3, PAX7, paralogs, orthologs, and genetic variants 
thereof. In some embodiments, the cells are modified to 
express one or more mvogenic transcription factors as 
described in a PCT Publication No. WO/2015/066377, 
incorporated by reference herein in its entirety. 


[0041] In some embodiments, the cells comprise a mixture 
of cell populations described herein, e.g., a mixture of 
fibrogenic cells and myogenic cells in co-culture, e.g., a 
mixture of fibroblasts and myoblasts in co-culture. In some 
embodiments, the cells used for the in vitro production of 
cell-based products for consumption are a mixture of fibro- 
blasts and myoblasts in a suspension co-culture. In some 
embodiments the cells used for the in vitro production of 
cell-based products for consumption are a mixture of fibro- 
blasts and myoblasts in an adherent co-culture. In some 
embodiments, the co-culture can further comprise adipo- 
cytes. 

[0042] In some embodiments, the cells are in either a 
suspension culture or adherent co-culture, and comprise a 
mixture of fibroblasts and myoblasts, wherein the ratio of the 
fibroblasts to myoblasts (designated as F and M) ranges 
from about 5F:95M to about 95F:5M. In exemplary embodi- 
ments, the ratio of the fibroblasts to myoblasts is about 
5F:95M, 10F:90M, 15F:85M, 20F:80M, 25F:75M, 30F: 
70M, 35F:65M, 40F:60M, 45F:55M, 50F:50M, 55F:45M, 
60F:40M, 65F:35M, 70F:30M, 75F:25M, 80F:20M, 85F: 
15M, 90F:10M, or even about 95F:5M. 


[0043] In some embodiments, the cells are genetically 
modified to inhibit a pathway, e.g., the HIPPO signaling 
pathway. Exemplary methods to inhibit the HIPPO signaling 
pathway as described in PCT Publication No. WO/2018/ 
208628, incorporated by reference herein in its entirety. 


[0044] In some embodiments, the cells are modified to 
express telomerase reverse transcriptase (TERT) and/or 
inhibit cyclin-dependent kinase inhibitors (CKI). In some 
embodiments, the cells are modified to express TERT and/or 
inhibit cyclin-dependent kinase inhibitors as described in 
PCT Publication No. WO 2017/124100, incorporated by 
reference herein in its entirety. 


[0045] In some embodiments, the cells are modified to 
express glutamine synthetase (GS), insulin-like growth fac- 
tor (IGF), and/or albumin. Exemplary methods of modifying 
cells to express GS, IGF, and/or albumin are described in 
PCT Publication No. WO/2019/014652, which is incorpo- 
rated by reference herein in its entirety. 


[0046] In some embodiments, the cells may comprise any 
combination of the modifications described herein. 


Cultivation Infrastructure 


[0047] As referred to herein, a cultivation infrastructure 
refers to the environment in which the cells are cultured or 
cultivated to provide a two-dimensional or three-dimen- 
sional product for consumption. 
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[0048] A cultivation infrastructure may be a roller bottle, 
a tube, a cylinder, a flask, a petri-dish, a multi-well plate, a 
dish, a vat, an incubator, a bioreactor, an industrial fermenter 
and the like. 

[0049] While the cultivation infrastructure itself may have 
a three-dimensional structure or shape, the cells cultured in 
the cultivation infrastructure may form a monolayer of cells. 
Compositions and methods of the present disclosure can 
promote a three-dimensional growth of metazoan cells in the 
cultivation infrastructure to provide a scaffold-less self- 
assembly of a three-dimensional cellular biomass. 

[0050] A three-dimensional cultivation infrastructure may 
be sculpted into different sizes, shapes, and forms, as 
desired, to provide the shape and form for the muscle cells 
to grow and resemble different types of muscle tissues such 
as steak, tenderloin, shank, chicken breast, drumstick, lamb 
chops, fish fillet, lobster tail, etc. The three-dimensional 
cultivation infrastructure may be made from natural or 
synthetic biomaterials that are non-toxic so that they may 
not be harmful if ingested. Natural biomaterials may 
include, for example, collagen, fibronectin, laminin, or other 
extracellular matrices. Synthetic biomaterials may include, 
for example, hydroxyapatite, alginate, polyglycolic acid, 
polylactic acid, or their copolymers. The three-dimensional 
cultivation infrastructure may be formed as a solid or 
semisolid support. 

[0051] A cultivation infrastructure can be of any scale and 
support any volume of cellular biomass and culturing 
reagents. In some embodiments, the cultivation infrastruc- 
ture ranges from about 10 uL to about 100,000 L. In 
exemplary embodiments, the cultivation infrastructure is 
about 10 uL, about 100 uL, about 1 mL, about 10 mL, about 
100 mL, about 1 L, about 10 L, about 100 L, about 1000 L, 
about 10,000 L, or even about 100,000 L. 


[0052] In some embodiments, the cultivation infrastruc- 
ture comprises a substrate. A cultivation infrastructure may 
comprise a permeable substrate (e.g., permeable to physi- 
ological solutions) or an impermeable substrate (e.g., imper- 
meable to physiological solutions). The substrate can be flat, 
concave, or convex. The substrate may be textured so as to 
promote cell growth and cell sheet attachment. 

[0053] In some embodiments. the culturing of cells in the 
cultivation infrastructure can induce the production of extra- 
cellular matrix (ECM) that may act as an autologous scaffold 
to direct three-dimensional cellular growth, e.g., to direct 
attachment, proliferation and hypertrophy of cells on a plane 
perpendicular to the substrate. 

[0054] In some embodiments, the cultivation infrastruc- 
ture may not comprise an exogenously added scaffold to 
promote self-assembly of a three-dimensional cellular bio- 
mass. In some embodiments, the cultivation infrastructure 
may not comprise exogenous scaffolds such as a hydrogel or 
soft agar. 


Culturing Conditions 


[0055] The culturing conditions for the generation of 
cell-based products for consumption are generally aseptic, 
and sterile. 

[0056] Cells can be grown in an adherent culture format to 
form a cell sheet or can be grown in a suspension culture 
format to form a cell pellet. Table 1, shown below, provides 
exemplary culture methods for the various products that can 
be produced in vitro. 
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Base media 


DMEM-F12 with 
FBS (High), BS (High), 
CS (Low), HS (Low) 
DMEM-F12 with 
FBS (High), BS (High), 
CS (Low), HS (Low) 
DMEM-F12 with 
FBS (High), BS (High), 
CS (Low), HS (Low) 
DMEM-F12 with 
FBS (High), CS (Low) 
DMEM-F12 with 
CS (High), BS (Low) 
DMEM-F12 with 
CS (High), BS (High) 
DMEM-F12 with 
BS (High), CS (Low) 
MEM-F12 with 10% FBS 


DMEM-F12 with 
FBS (High), CS (Low) 
DMEM-F12 with 
BS (High), CS (Low) 
DMEM-F12 with 
BS (High), CS (Low) 
DMEM-F12 with 
BS (High), CS (Low 
DMEM-F12 with 
BS (High), CS (Low) 
DMEM-F12 with 
BS (High), CS (Low 


Chemically-defined media 


with BS (low) 


Chemically defined media 


formula. No serum 
SMEM-F12 with 


TABLE 1 
Cell culture methods used to generate in vitro produced cell-based meat 
Culture 
Condition 
Cell Type(s) Culture 
Method # Sample ID (ratio) format 
1 A. platyrhynchos (duck) Co-culture Adherent 
fibroblast/myoblast tissue 1 F/M (50/50) 
2 A. Platyrhynchos (duck) ^ Monoculture F Adherent 
fibroblast tissue 1 
3 Bos (Cow) Monoculture F Adherent 
fibroblast tissue 1 
4 Gallus (chicken) Monoculture F Adherent 
fibroblast tissue 1 
5 Gallus (chicken) Monoculture F Adherent 
fibroblast tissue 2 
6 Gallus (chicken) Monoculture F Adherent 
fibroblast tissue 3 
7 Gallus (chicken) Monoculture F Adherent 
fibroblast tissue 4 
8 Gallus (chicken) Monoculture F Adherent D 
fibroblast cells 1 
9 Gallus (chicken) Co-culture Adherent 
fibroblast/myoblast tissue 1 F/M (30/70) 
10 Gallus (chicken) Monoculture F Adherent 
fibroblast tissue 5 
11 Gallus (chicken) Monoculture M Suspension 
myoblast cells 1 
12 Gallus (chicken) Co-culture Adherent 
fibroblast/myoblast tissue 2 F/M (30/70) 
13 Gallus (chicken) Co-culture Adherent 
fibroblast/myoblast tissue 3 F/M (50/50) 
14 Gallus (chicken) Co-culture Adherent 
fibroblast/myoblast tissue 4 F/Monoclonal 
M (50/50) 
15 Gallus (chicken) Co-culture Adherent 
fibroblast/myoblast tissue 5 F/Monoclonal 
M (70/30) 
16 Gallus (chicken) Monoculture M Suspension 
myoblast cells 2 
17 Gallus (chicken) Monoculture M Suspension 
myoblast cells 3 


[0057] In some embodiments, the media is substantially 
free of serum or other components derived from an animal. 
[0058] Accordingly, an exemplary method of producing in 
vitro produced cell-based meat comprises: (a) providing 
fibroblasts and/or myoblasts from a non-human organism; 
(b) culturing the fibroblasts and/or myoblasts in media under 
conditions under which the fibroblasts and/or myoblasts 
grow in either suspension culture or adherent culture, 
wherein the media is substantially free of serum and other 
components derived from an animal. 

[0059] In some embodiments, the cells are grown in a 
suspension culture, e.g., in a shake flask, and the product of 
the culture is centrifuged, yielding a cell pellet. In other 
embodiments, the cells are grown in adherent culture, and 
the product of the culture is a cell sheet. 


Formulation 


[0060] The cell-based products for consumption of the 
disclosure may be processed into any variety of products 
including, but not limited to, cell-based processed food 
products, cell-based meat products, cell-based beverage 
products, additives, ingredients, nutritional supplements, 
vitamins, nutraceuticals, broths, flavorings, fractionations, 


BS (high), CS (low) 


concentrates, decoctions, isolates, extracts, tinctures, dyes, 


oils, soaps, detergents, lubricants, fragrances, cosmetics, 
medications, emollients, topical ointments, or any other 
suitable edible or consumable product. Exemplary cell- 
based products of the disclosure include cell-based meat 
products, such as, for example, avian meat products, chicken 
meat products, duck meat products, and bovine meat prod- 
ucts. 


Characteristics of Cell-Based Products for Consumption 


[0061] Provided herein are in vitro produced cell-based 
products for consumption comprising a number of unique 
features that allow them to be distinguished from conven- 
tional products, for example, conventional meat products, 
which can involve the slaughter or demise of live animals. 
The in vitro methods can also be tailored to achieve desired 
traits such as health and sensory benefits. 


Hormones 


[0062] As compared to conventional products, the in vitro 
produced cell-based products of the disclosure can comprise 
a significantly lower amount of steroid hormones. For 
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example, using the in vitro culturing methods described, 
there need not be any exogenous hormones added into the 
culture thus resulting in lower or non-existent hormonal 
levels in a resulting cell-based meat product. Accordingly, in 
some embodiments, the cell-based product is substantially 
free of steroid hormones (i.e., contains little or no steroid 
hormones). This is in contrast to products from animals 
raised for conventional meat production, which are often fed 
or otherwise administered exogenous hormones. 

[0063] Accordingly, in some embodiments, the cell-based 
product of the disclosure comprises no more than about 1 ug, 
0.5 ug, 0.1 ug, 0.05 ug, 0.01 ug, 0.005 ug, or even about 
0.001 ug steroid hormone/kg dry mass of cell-based product. 
In some embodiments, the cell-based product comprises no 
more than about 1 ug, 0.5 ug, 0.1 ug, 0.05 ug, 0.01 ug, 0.005 
ug, or even about 0.001 ug progesterone/kg dry mass of 
cell-based product. In some embodiments, the cell-based 
product comprises no more than about 1 ug, 0.5 ug, 0.1 ug, 
0.05 ug, 0.01 ug, 0.005 ug, or even about 0.001 ug testos- 
terone/kg dry mass of cell-based product. In some embodi- 
ments, the cell-based product comprises no more than about 
0.05 ug, 0.01 ug, 0.005 ug, or even about 0.001 ug estradiol/ 
kg dry mass of cell-based product. In exemplary embodi- 
ments, the cell-based product comprises no more than about 
35 ng estradiol/kg dry mass of cell-based product. 


Microbial Contamination 


[0064] Using the sterile, laboratory-based in vitro cultur- 
ing methods described, the cell-based product is substan- 
tially free of microbial contaminants. “Substantially free” 
means that the concentration of microbes or parasites is 
below a clinically significant level of contamination, i.e., 
below a level wherein ingestion could lead to disease or 
adverse health conditions. Such low levels of contamination 
allow for an increased shelf life. This is in contrast to 
products from animals raised for conventional meat produc- 
tion. As used herein, microbial contamination includes, but 
is not limited to, bacteria, fungi, viruses, prions, protozoa, 
and combinations thereof. Harmful microbes may include 
coliforms (fecal bacteria), E. coli, yeast, mold, Campylo- 
bacter, Salmonella, Listeria, and Staph. 

[0065] In addition, cells grown in culture may be substan- 
tially free from parasites such as tapeworms that infect cells 
of live animals and that are transferred to humans through 
consumption of insufficiently cooked meat. 


Antibiotics 


[0066] Relative to conventional products, in vitro pro- 
duced cell-based products of the disclosure comprise a 
significantly lower amount of antibiotics, or are substantially 
free of antibiotics, or are free of antibiotics entirely. For 
example, using the in vitro culturing methods described 
herein, the use of antibiotics in culture can be controlled or 
eliminated, thus resulting in lower or non-existent antibiotic 
levels in the resulting cell-based product. Accordingly, in 
some embodiments, the cell-based product is substantially 
free of antibiotics (i.e., contains little or no antibiotics). This 
is in contrast to animals raised for conventional meat pro- 
duction, which are often fed or otherwise administered 
exogenous antibiotics. 

[0067] Accordingly, in some embodiments, the cell-based 
product of the disclosure comprises no more than about 100 
ug antibiotics/kg dry mass of cell-based product, 90 ug 
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antibiotics/kg dry mass of cell-based product, 80 ug antibi- 
otics/kg dry mass of cell-based product, 70 ug antibiotics/kg 
dry mass of cell-based product, 60 ug antibiotics/kg dry 
mass of cell-based product, 50 ug antibiotics/kg dry mass of 
cell-based product, 40 ug antibiotics/kg dry mass of cell- 
based product, 30 ug antibiotics/kg dry mass of cell-based 
product, 20 ug antibiotics/kg dry mass of cell-based product, 
10 ug antibiotics/kg dry mass of cell-based product, 5 ug 
antibiotics/kg dry mass of cell-based product, 1 ug antibi- 
otics/kg dry mass of cell-based product, 0.5 ug antibiotics/kg 
dry mass of cell-based product, 0.1 ug antibiotics/kg dry 
mass of cell-based product, 0.05 ug antibiotics/kg dry mass 
of cell-based product, or even about 0.01 ug/kg of antibiot- 
ics/kg dry mass of cell-based product. 


Lipids 


[0068] As compared to conventional products, the in vitro 
produced cell-based products of the disclosure comprise a 
lower average total lipid (fat) content. For example, cell- 
based meat generally has an average total fat content 
between about 0.5% to about 5.0%, whereas the fatty acid 
content in conventional meat varies widely and can range 
from about 3% to about 18%, depending on the cut of meat. 


[0069] Accordingly, in some embodiments, the cell-based 
product of the disclosure comprises an average total fat 
content of about 0.5%, about 0.6%, about 0.7%, about 0.8%, 
about 0.9%, about 1.0%, about 1.1%, about 1.2%, about 
1.3%, about 1.4%, about 1.5%, about 1.6%, about 1.7%, 
about 1.8%, about 1.9%, about 2.0%, about 2.1%, about 
2.2%, about 2.3%, about 2.4%, about 2.5%, about 2.6%, 
about 2.7%, about 2.8%, about 2.9%, about 3.0%, about 
3.1%, about 3.2%, about 3.3%, about 3.4%, about 3.5%, 
about 3.6%, about 3.7%, about 3.8%, about 3.9%, about 
4.0%, about 4.1%, about 4.2%, about 4.3%, about 4.4%, 
about 4.5%, about 4.6%, about 4.7%, about 4.8%, about 
4.9%, or about 5.0%, when measured as a % of total wet 
mass of the cell-based product. A lower fat content provides 
a lower caloric content, as well as other related health 
benefits, when compared to conventional products. 


[0070] The methods provided herein can alter specific 
fatty acid profiles to achieve desired flavor characteristics or 
fatty acid profiles of the product. The lower levels of fatty 
acids in the cell-based product of the disclosure also pro- 
motes an increased shelf life, for example by leading to 
lower levels of fatty oxidation in the product. 


Amino Acids 


[0071] The cell-based meat product of the disclosure gen- 
erally comprises about 50 g to about 95 g by weight of amino 
acids per 100 g dry mass. 


Vitamin E Content 


[0072] As compared to conventional products, the in vitro 
produced cell-based products of the disclosure comprise a 
higher Vitamin E (aTocopherol) content. In some embodi- 
ments, the cell-based product of the disclosure comprises at 
least about 0.5 mg, at least about 0.6 mg, at least about 0.7 
mg, at least about 0.8 mg, at least about 0.9 mg, or at least 
about 1.0 mg/Vitamin E/100 g wet mass of cell-based 
product. 


US 2022/0333081 A1 


Moisture Content 


[0073] The cell-based product of the disclosure generally 
has a moisture content of about 65% to about 95%. 


Architecture of Cell-Based Meat 


[0074] By way of example, cell-based meat, unless oth- 
erwise manipulated to include, does not include vascular 
tissues, such as veins and arteries, whereas conventional 
meat does contain such vasculature, and contains the blood 
found in the vasculature. Accordingly, in some embodi- 
ments, the cell-based meat does not comprise any vascula- 
ture. 

[0075] Likewise, cell-based meat, although composed of 
muscle or muscle-like tissues, unless otherwise manipulated 
to include, does not comprise functioning muscle tissue. 
Accordingly, in some embodiments, the cell-based meat 
does not comprise functioning muscle tissue. 

[0076] It is noted that features such as vasculature and 
functional muscle tissue can be further engineered into the 
cell-based meat produced from this disclosure, should there 
be a desire to do so. 


Supplementation 


[0077] In other embodiments, other nutrients, such as 
vitamins, may be added to increase the nutritional value of 
the cell-based product. For example, this may be achieved 
through the exogenous addition of the nutrients to the 
growth medium or through genetic engineering techniques. 


Shelf Life 


[0078] A significant portion of meat and meat products are 
spoiled every year. It is estimated that approximately 3.5 
billion kg of poultry and meat are wasted at the consumer, 
retailer and foodservice levels which have a substantial 
economic and environmental impact (Kantor et al. (1997). 
A significant portion of this loss is due to microbial spoilage. 
[0079] Conventional meat is perishable and has a rela- 
tively short shelf life stability (interchangeably referred to as 
simply “shelf life” herein). The shelf life is the amount of 
time a food remains fit for human consumption. The com- 
position of conventional meat and the conditions used to 
slaughter and harvest the meat create favorable growth 
conditions for various microorganisms including fecal bac- 
teria (e.g., coliform bacteria). Meat is also very susceptible 
to spoilage due to chemical, oxidative and enzymatic activi- 
ties. It is generally regarded that microbial growth, oxidation 
and enzymatic autolysis are three mechanisms responsible 
for the spoilage of meat. The breakdown of fat, protein and 
carbohydrates of meat results in the development of off- 
odors and off-flavor and these the off-odors and off-flavors 
make the meat objectionable for human consumption. 
Depending on the species and method of harvest, conven- 
tional meat products are not safe to consume after a rela- 
tively short period of storage time. For example, chicken 
should be cooked within a few days of processing and/or 
purchasing. Cooked poultry can be safely stored in the fridge 
for only 4 days and the freezer for up to 4 months. It is, 
therefore, necessary to control meat spoilage in order to 
increase its shelf life and maintain its nutritional value, 
texture and flavor. 

[0080] In vitro produced cell-based meat, through its 
method of production and composition, produces a meat 
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product that has extended shelf life compared to conven- 
tional meat products and does not require the addition of 
preservative agents to obtain the shelf-life stability. The 
composition of cell-based meat is such that fewer off-odors 
and off-flavors are detected. In addition, the manufacturing 
methods used to produce in vitro cell-based meat require 
clean and aseptic conditions. These conditions ensure that 
microbial cell counts in both harvested products and subse- 
quent food processing are low. These multiple factors con- 
tribute to extended shelf-life stability of in vitro cell-based 
meat. 


[0081] The shelf life due to spoilage of the cell-based meat 
of the disclosure is enhanced relative to conventional meat. 
This is the case both at room temperature (about 25° C.) and 
at colder temperatures (e.g. about 4° C.). The increased shelf 
life is associated with reduced contamination, composition 
of the cell-based meat, reduced degradation of the cell-based 
meat and slower rates of change in color, spoilage, smell and 
flavor of the cell-based meat. 


[0082] Without being bound to theory or mechanism, there 
is a decrease in total fatty acid content in the cell-based meat 
prepared via the methods described herein, as compared to 
conventional meat, resulting in lower levels of fatty acid 
oxidation, leading to slower rates of change in the color, 
smell, or flavor of the meat. 


[0083] Without being bound to theory or mechanism, there 
is a decrease in the number of lipases in the cell-based meat 
prepared via the methods described herein, as compared to 
conventional meat, resulting in lower levels of fatty acid 
breakdown, leading to slower rates of change in the color, 
smell, or flavor of the meat. 


[0084] Without being bound to theory or mechanism, due 
to the absence of vasculature in the cell-based meat prepared 
via the methods described herein, when compared to con- 
ventional meat, there is less oxygen present, resulting in 
lower levels of fatty acid oxidation and the growth of aerobic 
bacteria, leading to reduced microbial contamination levels, 
and leading to slower rates of change in the color, smell, or 
flavor of the meat. 


[0085] Without being bound to theory or mechanism, due 
to the absence of functional muscle tissue (e.g., myoglobin) 
in the cell-based meat prepared via the methods described 
herein when compared to conventional meat, there is less 
oxygen present, resulting in lower levels of fatty acid 
oxidation and the growth of aerobic bacteria, leading to 
reduced microbial contamination levels, and leading to 
slower rates of change in the color, smell, or flavor of the 
meat. 


[0086] Without being bound to theory or mechanism, due 
to higher amounts of Vitamin E in the cell-based meat 
prepared via the methods described herein when compared 
to conventional meat, there are higher levels of antioxidant 
activity, resulting in protection against fatty acid oxidation, 
and leading to slower rates of change in the color, smell, or 
flavor of the meat. 


[0087] Accordingly, in some embodiments, as compared 
to conventional meat, the shelf life of cell-based meat is 
increased by at least about 1.5x, at least about 2x, at least 
about 2.5x, at least about 3x, at least about 3.5x, at least 
about 4x, at least about 4.5x, at least about 5x, at least about 
5.5x, at least about 6x, at least about 6.5x, at least about 7x, 
at least about 7.5x, at least about 8x, at least about 8.5x, at 
least about 9x, at least about 9.5x, or even at least about 10x. 
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The shelf-life increases are observed both at about 4° C., and 
about 25° C., and all temperatures in between inclusive of 
the endpoints. 

[0088] Cell-Based Products for Consumption Comprising 
Proteins from Exotic, Endangered, or Extinct Species 
[0089] In preferred embodiments of the invention, the 
cell-based products for consumption may comprise proteins 
from exotic, endangered, or extinct species. In a particular 
embodiment, the cell-based products for consumption may 
be cell-based meat products. The proteins from exotic, 
endangered, or extinct species may comprise, but are not 
limited to, myosin, actin, creatine kinase, tropomvosin, 
fibronectin, collagen, myoglobin, or keratin. The proteins 
may come from such species as mammoth, mastodon, 
saber-tooth tiger, hippopotamus, lion, dinosaur, rhinoceros, 
elephant, passenger pigeon, or shark. 

[0090] In a preferred first embodiment, DNA, RNA, or 
protein sequences from any of the above species may be 
identified by single-nucleotide polymorphisms (SNPs) and 
compared to a reference genome for an agriculturally- or 
nutritionally-relevant host animal, bacteria, or yeast. Iden- 
tified regions for DNA sequences may be coding or non- 
coding. By way of example, a protein sequence for dinosaur 
collagen may be identified and mapped to a reference 
chicken genome. Or, alternatively, nuclear or mitochondrial 
genomic sequences for woolly mammoths or mastodons 
may be identified and mapped to a reference elephant 
genome. 

[0091] In an alternative embodiment, protein may be 
expressed in microbial or yeast systems and subsequently 
purified, before being added into cell-based products for 
consumption during formulation. In a separate embodiment, 
protein may be expressed in vivo, for instance, in an embryo, 
such as a fertilized chicken egg. 

[0092] In certain embodiments, the agriculturally- or 
nutritionally-relevant host animal may have common ances- 
try with the exotic, extinct, or endangered species and may 
be familiar to human consumption. In preferred embodi- 
ments, the entity for consumption which comes from host 
animals may be beef, pork, chicken, seafood, fish, venison, 
goose, turkey, duck. Such hosts may be identified as “agri- 
culturally-relevant hosts”. With respect to common ancestry 
between host animals and exotic, extinct, or endangered 
species, by way of example, it is known that dinosaurs are 
a close common ancestor to poultry. In one instance, dino- 
saur collagen may be cloned into a chicken, squab, or turkey 
host line. Alternatively, mammals may be selected for 
expression of mammalian-derived proteins. In one instance, 
for example, proteins from an endangered red stag species 
may be expressed in an elk line. 

[0093] RNA or protein from any of the above species may 
then be back-translated into DNA sequences, which are then 
cloned in plasmids or vectors that are functional in host 
species. In certain embodiments, the plasmids or vectors 
may include a promoter DNA sequence (e.g., a cis-genic 
promoter) upstream of a cloning site designed to receive the 
back-translated DNA sequences. The plasmids or vectors 
may then be transfected by standard transfection methods 
into selected agriculturally- or nutritionally-relevant host 
animal cells for culture, such as, for example, host chicken, 
bacteria, or yeast cells, in order to enable incorporation of 
DNA sequences encoding for proteins from exotic, endan- 
gered, or extinct species into such host animal cells. In some 
embodiments, delivery of expression vectors into host cells 
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may include any suitable mechanism, including, but not 
limited to, CRISPR, Zinc Fingers, Talens, PhiC31, Piggy- 
bac, artificial chromosomes, or viral vectors. In a preferred 
further embodiment, the proteins may be muscle-derived. 
By way of example, an amino acid sequence of a skeletal 
muscle myosin protein from mammoth may be determined 
and the DNA for that protein transfected into a host cell. 
[0094] Genetic modifications, such as insertions or dele- 
tions, may then be introduced into the host genome to create 
identified unique SNPs or specific protein products. Genome 
modification may comprise any suitable mechanism includ- 
ing, but not limited to, CRISPR, Zinc Fingers, Talens, 
PhiC31, Piggybac, artificial chromosomes, or viral vectors. 
Alternatively, expression or repression of the host genome 
may be achieved by epigenetic activation or repression by 
any suitable mechanism, such as, for example, CRISPR, 
Zinc Fingers, or relevant transcription factors. Integrated 
DNA sequences from extinct, exotic, or endangered species 
may then be expressed (e.g., mRNA-transcribed and trans- 
lated into protein) by the host animal cell machinery. In a 
preferred further embodiment, the DNA sequences may be 
codon-optimized to allow for efficient protein expression in 
host cells. Transfected host cells may then be selected and 
evaluated phenotypically for cell growth, gene expression, 
protein production, or production. Host animal cells dem- 
onstrating successful transfection and DNA expression may 
then be isolated and cloned to produce cell-based products 
for consumption, as described above. A flow chart depicting 
the above process can be seen in FIG. 1. 

[0095] In a further embodiment, the host native protein 
that corresponds to that expressed from exotic, endangered, 
or extinct species may be knocked out, deleted, or repressed 
via any known genetic recombination mechanism, such as 
CRISPR or homing endonucleases. Alternatively, the host 
native protein may be left unchanged. 

[0096] In a further embodiment, the transfected animal 
cell may comprise a mix of host native protein and trans- 
fected exotic/endangered/extinct protein. The mix, for 
instance, may range from 95% host protein:5% transfected 
protein, 85% host protein:15% transfected protein, 60% host 
protein:40% transfected protein, 50% host protein:50% 
transfected protein, 40% host protein:60% transfected pro- 
tein, 15% host protein:85% transfected protein, or 5% host 
protein:95% transfected protein, or any other suitable range 
not listed above. 

[0097] Ina further embodiment, the host protein may be 
completely knocked out such that the host animal cell 
comprises 100% transfected protein. For example, in a 
particular embodiment, collagen from a dinosaur species 
may be cloned and expressed in a cultured chicken cell and 
the native chicken collagen knocked out such that the 
cultured chicken cell only produces dinosaur collagen. 
[0098] Food products may then be created from resulting, 
successfully transfected host cells. Alternatively, food prod- 
ucts may be created from expressed proteins or metabolites 
from exotic, extinct, or endangered species. By way of 
example, dinosaur collagen may be expressed in cultured 
chicken host cells to produce cell-based meat or, alterna- 
tively, purified from yeast fermentation and added to food 
products. In alternative embodiments, the dinosaur collagen 
may be employed to produce jello. 

[0099] Ina preferred second embodiment, cells from sib- 
ling extant taxa that comprise a common ancestral species 
may be fused to form multi-nucleated syncytia known as 
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heterokaryons. Cell fusion may enable modulation (i.e. 
transcriptional activation or repression) of transcriptional 
regulatory mechanisms, wherein programs for such modu- 
lators active and originating from the nuclei of one species 
modulate transcription of the other species where such 
endogenous programs are inactive or non-existent. Such 
modulators are identified herein as trans-dominant. As 
shown in FIG. 2, examples of trans-dominant modulator 
effects include transcriptional activation by trans-dominant 
gene expression programs, that induce expression of genes 
from the common ancestor that were silenced over the 
course of evolution, and, reciprocally, transcriptional repres- 
sion by trans-dominant gene expression programs, that 
silence expression of genes from the extant species that were 
activated over the course of evolution. Thus, biochemical 
features of extinct progenitors species may be resurrected in 
their extant descendant species by altering gene expression 
from the extant species genome. Such biochemical features 
may comprise proteins, adducts of proteins, polymeric sug- 
ars, post-translational modifications, metabolic intermedi- 
ates, structural RNAs, steroidal compounds, and isoprenoid 
derivatives, such as lipids, fatty acids and steroids, whose 
presence or absence in extant species may be restored to 
functional levels comparable to those found in an extinct 
common ancestor. In certain embodiments, heterokaryons 
may comprise biochemical profiles common to the original 
extant taxa cells from which they were derived. Alterna- 
tivelv, heterokarvons may comprise a unique biochemical 
profile distinct from the original extant taxa cells from which 
they were derived. Heterokaryons that successfully express 
ancestral genes may then be subjected to the above cultiva- 
tion procedures to generate cell-based products for con- 
sumption that recapitulate unique organoleptic and nutri- 
tional profiles from extinct, common ancestral species. 


[0100] In a particular aspect, as shown in FIG. 3, cells 
from extant crocodilian and avian species may be fused 
together, activating a trans-dominant transcription program 
that induces expression of silenced genes from an archo- 
saurian progenitor common to both species. The expressed 
genes may be (1) specific to the avian species and the 
archosaurian progenitor; (2) specific to the crocodilian spe- 
cies and the archosaurian progenitor; (3) specific to both the 
avian and crocodilian species as well as the archosaurian 
progenitor; or (4) specific only to the archosaurian progeni- 
tor. In alternative aspects, cells to be fused may come from 
at least two species categorized under one or more of the 
following metazoan extant metazoan categories: fish, mam- 
mals, amphibians, birds, reptiles, or invertebrates. 


[0101] Ina further embodiment, the endogenous transcrip- 
tional regulatory mechanisms involved in inducing expres- 
sion of genes from the common ancestor may be spontane- 
ous and not require genetic or epigenetic modification. 


[0102] In a further embodiment, the native cell fusion 
mechanisms from two sibling taxa comprising the hetero- 
karyon progenitor cells may be sufficient to enable cell 
fusion as a conduit for trans-dominant transcriptional modu- 
lation within the heterokaryon. In addition, native cell fusion 
mechanisms may be augmented, mediated or facilitated by 
one or more of the following: activation or overexpression 
of one or more of the following proteins: mvomaker, myo- 
mixer, minion, myomeger, dysferlin, myoferlin, syncitin-1 
or their homologs; application of polyethylene glycol; or 
application of an electric current. 
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[0103] Ina further embodiment, trans-dominant transcrip- 
tional modulation between different nuclei within the het- 
erokarvon originating from distinct species mav be sponta- 
neous and not require further alteration of the chromatin 
through the addition and or removal of epigenetic modifi- 
cations such as DNA methylation, histone methylation and 
histone acetylation to convev or augment such trans-domi- 
nant transcriptional modulation. 

[0104] Ina further embodiment, trans-dominant transcrip- 
tional modulation between different nuclei within the het- 
erokaryon originating from distinct species may be enabled 
or augmented by further alteration of the chromatin through 
the addition and or removal of epigenetic modifications such 
as DNA methylation, histone methylation and histone acety- 
lation to convey or augment such trans-dominant transcrip- 
tional modulation upon the affected chromatin. Epigenetic 
modifiers may include one or more of the following factors: 
S-azacvtidine, 5-aza-2'-deoxycytidine, trichostatin A, valp- 
roic acid, butvrate, stilbenoid activators of sirtuin homologs 
such as 3,5,4'-trihydroxy-trans-stilbene, histone deacetylase 
enzymes, histone acetyltransferase enzymes, histone meth- 
yltransferase enzymes, histone demethylase enzymes and 
even ten-eleven translocation (TET) methylcytosine dioxy- 
genase enzymes. 

[0105] Although a minimum of two nuclei from two 
different species may be required to generate a heterokaryon 
in the application of this invention, the classifications of 
nuclei are, for the purposes of this invention, not limited to 
two. For example, cells from three or more species can be 
fused into a heterokaryon wherein a more comprehensive set 
of complimentary trans-dominant transcriptional modula- 
tors alone or in combination with target genetic sequences 
for transcriptional modulation may support a biochemical 
profile more closely resembling the targeted profile. Beyond 
having nuclei from two different species, the distinctions 
introduced by the third nuclei in such heterokaryons are not 
limited to species distinction. For example, the heterokaryon 
may consist of nuclei originating from myoblasts of species 
A, nuclei originating from myoblasts of species B, and 
nuclei originating from fibroblasts of species A. Wherein it 
is expected that the genetic sequences in the species A 
myoblasts and fibroblasts would substantially be the same, 
the trans-dominant transcriptional modulators expressed by 
these two cell lineages may differ, thereby adding further 
diversity to the transcriptional control mechanisms within 
the heterokaryon. In certain embodiments, the heterokaryon 
may represent one or more tissue lineages. 

[0106] Particularly preferable embodiments of this inven- 
tion include methods of synthesizing extinct muscle derived 
tissue, comprising: selecting an extinct genome comprising 
at least a portion of a genome of an extinct species; selecting 
a foundational genome comprising at least a portion of a 
genome of a non-extinct ancestor to the extinct species; 
identifying absent genes, the absent genes comprising genes 
present in the extinct genome and absent from the founda- 
tional genome; combining the foundational genome and the 
absent genes to generate a hybridized genome; expressing 
the absent genes of the hybridized genome in a growing cell 
mass; and preparing the grown cell mass for consumption as 
a cell-based meat product. 

[0107] Inalternative embodiments, the genes in the extinct 
genome may exhibit distinct expression from sister genes in 
the foundational genome and serve as replacements or 
substitutions for the sister genes of the foundational genome. 
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[0108] In further embodiments, the extinct species may be 
a mammoth and the non-extinct ancestor may be an 
elephant. Alternatively, the extinct species may be a dino- 
saur and the non-extinct ancestor may be a chicken. 
[0109] The absent genes may comprise only genes that 
support muscle tissue growth, such as collagen or myosin 
genes. Alternatively, the absent genes may comprise actin, 
creatine kinase, tropomyosin, fibronectin, mvoglobin, or 
keratin. 

[0110] In some embodiments, the grown cell mass may 
comprise cells of the non-extinct ancestor transfected with 
the hvbridized genome. In particular examples, the cells mav 
be derived from chicken and the hvbridized genome mav be 
codon optimized for expression bv cultured chicken cells. 
[0111] In further embodiments, the foundational genes of 
the hvbridized genome mav be expressed in the growing cell 
mass. In addition, the absent genes mav be overexpressed 
using a promoter. In certain embodiments, the extinct 
genome mav comprise at least a portion of two or more 
extinct species. 

(0112) Alternative embodiments mav comprise a cell- 
based food product for consumption comprising extinct 
muscle derived tissue, the extinct muscle derived tissue 
comprising at least one gene from an extinct species and at 
least one gene from a non-extinct ancestor to the extinct 
species, wherein the at least one gene from the extinct 
species is absent from the genome of the non-extinct ances- 
tor. 

[0113] Alternatively, the at least one gene from the extinct 
species mav be substituted for a sister gene from the 
non-extinct ancestor. 

[0114] The above embodiments are by no means limiting 
and include anv combination of extinct and foundational 
genes that fall within the present disclosure. This invention 
is further illustrated bv the following additional examples 
that should not be construed as limiting. Those of skill in the 
art should, in light of the present disclosure, appreciate that 
manv changes can be made to the specific embodiments 
which are disclosed and still obtain a like or similar result 
without departing from the spirit and scope of the invention. 
[0115] All of the claims in the claim listing are herein 
incorporated by reference into the specification in their 
entireties as additional embodiments. 


EXAMPLES 


Example 1: Prediction of Repressed Gene 
Expression Profiles and Protein Expression Profiles 
in Avian-Crocodilian Homokaryons and 
Heterokaryons 


[0116] Primary Isolation of Satellite Cells from Avian and 
Crocodilian Species. 

[0117] Cells were isolated from a juvenile chicken (Gallus 
gallus) and a juvenile American alligator (Alligator missis- 
sippiensis) by the following procedure. Tissues samples 
were aseptically obtained from the pectoralis muscle group 
from sub-adult specimens of each species using methodolo- 
gies generally known in the art to support the isolation of 
satellite cells from skeletal muscle tissues originating from 
a plurality of vertebrate species. The obtained tissue was 
enzymatically dissociated to release satellite cells from the 
skeletal muscle explants. Tissue debris was separated from 
the isolated cells by filtration through a fine mesh. Cells 
were separated from the enzymatic digest solution by cen- 
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trifugation and resuspended in a growth medium consisting 
of a Dulbecco’s Modified Eagle Medium containing a 
bicarbonate buffering system and supplemented with gluta- 
mine, ungulate serum, chick embryo extract and an antibi- 
otic/antimycotic solution. 


Cultivation of Crocodilian and Avian Myoblasts 


[0118] Satellite cells isolated from the avian and croco- 
dilian tissues were seeded onto tissue-culture grade poly- 
styrene culture dishes in the growth medium onto surfaces 
previously coated with gelatin. Culture dishes seeded with 
crocodilian cells were incubated at temperatures ranging 
from 86-93° C., whereas cells isolated from avian tissue 
were incubated at 37° C. From this point, proliferating 
progeny of the isolated satellite cells are identified as 
myoblasts. All cultures were incubated under 5% atmo- 
spheric carbon dioxide culture conditions. The temperature 
condition supporting the most robust cell outgrowth of the 
crocodilian myoblasts was first used to expand the popula- 
tion of crocodilian myoblasts. Next, cells were passaged 
from this condition to otherwise equivalent conditions rang- 
ing in temperature from the starting condition supporting the 
most robust cell growth to 37° C. The temperature closest to 
37° C. capable of supporting a cell population crocodilian 
cell doubling rate below forty-eight hours was selected to 
co-culture avian and crocodilian myoblasts. 


Co-Culture and Fusion of Crocodilian and Avian Myoblasts 


[0119] Avian cultures and selected crocodilian cultures 
were enzymatically dissociated and seeded onto gelatin- 
coated tissue culture grade plastic dishes in growth medium 
at 15,000 cells/cm? for each cell population, for a total 
seeding density of 30,000 cells/cm?. The co-culture was 
returned to the incubator setting for the selected crocodilian 
cultures. Once this co-culture reached approximately 80% 
confluence, ungulate serum in the growth medium was 
reduced and ungulate insulin-like growth-factor was added 
to support myoblast fusion within the co-culture and forma- 
tion of myotube heterokaryons consisting of both crocodil- 
ian and avian nuclei. Incubation of the co-cultures was 
continued until myotubes in the culture dish reached their 
maximum size, at which point cellular biomass is collected 
for processing for downstream analytics or as a food ingre- 
dient or product. 

[0120] Identification of Repressed Avian and Crocodilian 
Gene Expression Programs Activated in Crocodilian-Avian 
Cell Heterokaryons 

[0121] The defining aspects of this innovation include (1) 
methods for activating expression of repressed gene expres- 
sion programs in Avian and Crocodilian species as described 
in an exemplary manner in this example, and (2) protein and 
lipid profile constituency subsequent to activated gene 
expression profiles in Avian and Crocodilian heterokaryons 
not found in either the Avian or Crocodilian gene expression 
programs of the contributing myoblasts from the respective 
constituent species represented within the co-cultures pre- 
viously described within this example. 

[0122] In a separate experiment, avian or crocodilian 
myoblasts were seeded and differentiated into myotubes as 
described in the procedures for establishing avian-crocodil- 
ian heterokaryons, except that both avian and crocodilian 
myoblasts were seeded at 30,000 cells per cm? and directed 
to fuse and form myotubes as species-exclusive monocul- 
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tures, resulting in homokaryons. Once the homokaryon 
myotubes reached their maximum size, their cellular bio- 
mass was collected and processed for downstream analytics. 
[0123] Relevant downstream analytics for determining the 
gene expression profiles and predicting the protein expres- 
sion profiles of the heterokaryon and homokaryon cultures 
include RNA-seq, and variations of mass spectroscopy such 
as LC-MS to determine lipid profiles. Activated gene 
expression profiles present in heterokaryon cultures but 
repressed in either homokaryon cultures will be determined 
by subtracting orthologous gene expression common to both 
homo- and heterokaryon cultures from heterokaryon gene 
expression profiles. The remaining heterokaryon gene 
expression profile will be classified and heterokaryon-trans- 
activated gene expression predictive of gene-expression 
programs expressed in the archosaurian ancestors of avian 
and crocodilian species no longer expressed in chickens and 
alligators. Expression of enzymatic proteins of lipid biosyn- 
thetic pathways arising from heterokaryon-transactivated 
gene expression in conjunction with metabolites are pre- 
dicted to result in the biosynthesis of cellular lipid molecules 
found neither in the crocodilian nor the avian lipid profiles 
displayed within their respective homokaryons. These lipid 
profiles define the major component of the flavor profile of 
meat in a species-specific manner in foods prepared from 
tissues of the respective species. Therefore, the organoleptic 
phenotype of heterokaryon-activated lipid biosynthetic path- 
way profiles is predicted to impart within foods prepared 
from the respective heterokaryon biomass authentic flavor 
notes of the extinct archosaurian ancestors of avian and 
crocodilian species no longer present in alligator or chicken 
muscle. 


Example 2: Expression of Dinosaur Collagen 1A1 
and 1A2 in Cultured Chicken Cells 


[0124] This experiment proposes to introduce ancient 
DNA coding for Tyrannosaurus Rex and Brachylophosaurus 
canadensis collagen into cultured chicken cells for the 
purpose of food product development. Specifically, extinct 
collagen genes from extinct Tyrannosaurus Rex and Brachy- 
lophosaurus canadensis species were mapped against foun- 
dational genes from chickens, a non-extinct ancestor to 
Tyrannosaurus Rex and Brachylophosaurus canadensis, to 
identifv collagen genes present in the extinct dinosaur 
species and absent in chickens. The dinosaur-specific col- 
lagen genes were then combined with the chicken genes and 
expressed in a growing culture of chicken cells. The mass of 
dinosaur collagen-expressing cultured chicken cells was 
grown to generate a cell-based meat product for consump- 
tion. 

[0125] As shown in FIGS. 4A-1-4A-4 and 4B-1-4B-4, 
Tyrannosaurus Rex and Brachylophosaurus canadensis col- 
lagen I, alpha I (COL 1A1) and collagen I, alpha 2 (COL 
1A2) sequences were mapped to reference chicken COL 
1A1 and COL 1A2 proteins to identify any amino acid 
sequence divergences that exist between the species. Rela- 
tive to chicken COL 1A1, amino acid substitutions P-406-A 
and S-407-P were found to be unique to Tyrannosaurus Rex, 
and amino acid substitutions P-776-S and A-779-P were 
found to be unique to Brachylophosaurus canadensis. Rela- 
tive to chicken COL 1A2, amino acid substitution I-605-P 
was found to be unique to Tyrannosaurus Rex, and amino 
acid substitution A-311-T was found to be unique to Brachv- 
lophosaurus canadensis. 
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[0126] As shown in FIGS. SA and 5B, the dinosaur- 
specific COL1A1 (P-406-A, S-407-P, P-776-S and A-779-P) 
amino acid substitutions were inserted into the chicken 
COLIAI reference protein sequence, chicken codon opti- 
mized, and designated Dinosaur COL1A1. The dinosaur- 
specific COL1A2 (l-605-P, A-311-T) amino acid substitu- 
tions were inserted into the chicken COL1A2 reference 
protein sequence, chicken codon optimized, and designated 
Dinosaur COL1A2. The Dinosaur COL1A1 gene sequence 
was chicken codon optimized and cloned into a PhiC31 
vector under the control of an EF la promoter. The Dinosaur 
COL1A2 gene sequence was codon optimized and cloned 
into a PhiC31 vector under control of a PGK promoter. The 
Dinosaur COL1A1 and 1A2 vectors were used to develop 
stable expression in cultured chicken cells for cultured meat 
while not reducing expression of any chicken-specific genes. 
[0127] As shown in FIG. 6, cultured chicken cells were 
transfected with the Dinosaur COL1A1 and 1A2 vectors. 
Dinosaur collagen 1A1 and 1A2 gene vectors or control 
empty vectors were transfected with integrase into cultured 
chicken cells. A naive sample served as an additional con- 
trol. As demonstrated by red fluorescent and green fluores- 
cent staining in both the empty vector and dinosaur-specific 
panels, transfection of cultured chicken cells was successful. 
FIGS. 7A and 7B serve as proof of concept that this 
transfection included dinosaur-specific collagen 1A1 and 
1A2. Dinosaur collagen 1A1 and 1A2 levels were clearly 
significantly enhanced relative to naive controls, indicating 
that genes for the specific dinosaur proteins were success- 
fully inserted into cultured chicken cells and expressed. And 
FIG. 8 demonstrates that expression of the Dinosaur colla- 
gen 1A1 and 1A2 correlates with expression of collagen, 
illustrating that the dinosaur-specific collagen proteins map 
to chicken-specific collagen. These findings demonstrate 
that dinosaur-specific collagen proteins can be successfully 
expressed in cultured chicken cells. 


Example 3: Expression of Mammoth Myosin 
Heavy Chain MYH13 in Cultured Chicken Cells 


[0128] This experiment proposes to introduce ancient 
DNA coding for Mammoth-specific myosin heavy chain 
protein into cultured chicken cells for the purpose of food 
product development. Specifically, extinct myosin genes 
from an extinct Woolly Mammoth species were mapped 
against foundational genes from non-extinct elephants, a 
non-extinct ancestor to Woolly Mammoth, to identify myo- 
sin genes in Woolly Mammoth that exhibit differential 
expression than their sister counterpart in elephants. The 
differentially expressed Woolly Mammoth-specific myosin 
genes were then substituted for the sister elephant genes and 
expressed in a growing culture of chicken cells. The mass of 
Woolly Mammoth myosin-expressing cultured chicken cells 
was grown to generate a cell-based meat product for con- 
sumption. 

[0129] Mammoth and elephant genomes were analyzed 
for protein coding non-synonymous single nucleotide poly- 
morphisms (SNPs) that only exist in Woolly Mammoth 
Myosin genes. Elephant genomes examined included Afri- 
can elephant (L_africana_B, L_africana_C, LoxAfr 3.0), 
Asian elephant (E_maximus_D, E_maximus_E), Forest 
Elephant (L_cyclotis_A, L_cyclotis_F), Straight tusked 
elephant (P antiquus N, P antiquus O). Mammoth 
genomes included in the analysis were Woolly Mammoth 
(Wrangle, Oimyakon, M primigenius G, M_primigenius_ 
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H, M_primigenius_S, M_primigenius_V) and Columbian 
Mammoth (M_columbi_U). As shown in FIGS. 9A-9D, 
mammoth specific protein coding SNPs were identified in 
myosin heavy chain gene MYH13 to include T-1306-S, 
N-1668-D, A-1914-V. These amino acid substitutions did 
not exist in any elephant genomes analyzed. The identified 
amino acid substitutions were applied to the Loxodonta 
africana MYH13 sequence. As shown in FIGS. 10A-10B, 
the mammoth myosin heavy chain gene (MYH13) contain- 
ing amino acids 1914-V, 1668-D, and 1306-S was created by 
mapping the amino acid substitutions to the Loxodonta 
africana reference genome. The nucleotide sequence was 
chicken codon optimized for expression in cultured chicken 
cells. 


(0130) As shown in FIG. 11, the mammoth-specific 
MYH13 containing 1914-V, 1668-D, and 1306-S amino 
acids was encoded in the Loxodonta africana MYH13 
reference sequence. The nucleotide sequence was chicken 
codon optimized and cloned into a PhiC31-RFP vector for 
stable expression in cultured chicken cells developed for 
cultured meat. 


SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS: 16 
<210> 
<211> 
<212> 
<213> 
<220> 


<223> 


SEO ID NO 1 

LENGTH: 1363 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Description of 
polypeptide 


<400> SEQUENCE: 1 


Met Leu Ser Phe 
1 5 


Thr Ile Leu Leu Leu Leu 


10 


Asp Arg 


Ala 
20 


Gln Val 


25 


Tyr Leu Ser His Ser Glu Ala Ser 


Pro Pro Gln Glu 


35 


Gly Arg Gly Asp Lys Gly Arg 


45 
Gly Pro 
60 


Pro Pro Pro 


50 


Gly Arg Asp Gly 


55 


Asp Gly Gly 


Phe 
75 


Pro Pro Pro Pro Leu Asn Ala Ala 


65 


Gly Gly 


70 


Gly Gly 


Ala Ala 


85 


Pro Ser Lys Asp Phe Gly Pro Gly Pro Met 


90 


Gly 


Pro Pro Ala Ser Pro Pro Pro 


100 


Pro Arg Gly Gly Gly 


105 


Gly 


Gln Val 


115 


Glu Glu Gln Thr 


125 


Gly Pro Gly Pro Gly Pro 


120 


Gly 


Ala 
140 


Gly Pro Pro Pro Pro Pro 


130 


Arg Gly Gly 


135 


Gly Lys Gly 


His Pro Pro Pro Glu Val 


145 


Gly Lys Gly 


150 


Arg Gly Arg 


155 


Gly 


Gln Ala Phe Thr Pro Leu Pro 


170 


Gly Arg Gly Pro 


165 


Gly Gly 


Gly Ile Arg Gly His Asn Gly Leu Asp Gly Gln Lys Gly 


[0131] As shown in FIG. 12, cultured chicken cells were 
transfected with the Mammoth MYH13 gene vector. A 
Mammoth MYH13 gene gene vector or control empty 
vector were transfected with integrase into cultured chicken 
cells. A naive sample served as an additional control. As 
demonstrated by red fluorescent staining in both the empty 
vector and mammoth-specific panels, transfection of cul- 
tured chicken cells was successful. FIG. 13 serves as proof 
of concept that this transfection included mammoth-specific 
MYH13. Mammoth MYH13 levels were clearly signifi- 
cantly enhanced relative to a naive control, indicating that 
the gene for the specific mammoth protein was successfully 
inserted into cultured chicken cells and expressed. And FIG. 
14 demonstrates that expression of the Mammoth MYH13 
correlates with expression of myosin, illustrating that the 
mammoth-specific myosin proteins map to chicken-specific 
myosin. These findings demonstrate that mammoth-specific 
myosin proteins can be successfully expressed in cultured 
chicken cells. 


REFERENCES 


[0132] All references referred to above are incorporated 
herein by reference in their entireties. 


Artificial Sequence: Synthetic 


Ala Val 


15 


Thr 


Ala 
30 


Gly Arg 


Gly Pro Pro 


Pro Pro Gly 


Gln Tyr Asp 


80 


Met 
95 


Leu Gly 


Pro Phe 


110 


Gly 


Gly Pro Gln 


Glu Asp Gly 


Ala Pro 


160 


Gly 


Phe 
175 


Gly Lys 


Gln Pro Gly 
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-continued 
180 185 190 


Thr Pro Gly Thr Lys Gly Glu Pro Gly Ala Pro Gly Glu Asn Gly Thr 
195 200 205 


Pro Gly Gln Pro Gly Ala Arg Gly Leu Pro Gly Glu Arg Gly Arg Ile 
210 215 220 


Glv Ala Pro Gly Pro Ala Gly Ala Arg Gly Ser Asp Gly Ser Ala Gly 
225 230 235 240 


Pro Thr Gly Pro Ala Gly Pro Ile Gly Ala Ala Gly Pro Pro Gly Phe 
245 250 255 


Pro Gly Ala Pro Gly Ala Lys Gly Glu Ile Gly Pro Ala Gly Asn Val 
260 265 270 


Gly Pro Thr Gly Pro Ala Gly Pro Arg Gly Glu Ile Gly Leu Pro Gly 
275 280 285 


Ser Ser Gly Pro Val Gly Pro Pro Gly Asn Pro Gly Ala Asn Gly Leu 
290 295 300 


Pro Gly Ala Lys Gly Ala Thr Gly Leu Pro Gly Val Ala Gly Ala Pro 
305 310 315 320 


Gly Leu Pro Gly Pro Arg Gly Ile Pro Gly Pro Pro Gly Pro Ala Gly 
325 330 335 


Pro Ser Gly Ala Arg Gly Leu Val Gly Glu Pro Gly Pro Ala Gly Ala 
340 345 350 


Lys Gly Glu Ser Gly Asn Lys Gly Glu Pro Gly Ala Ala Gly Pro Pro 
355 360 365 


Gly Pro Pro Gly Pro Ser Gly Glu Glu Gly Lys Arg Gly Ser Asn Gly 
370 375 380 


Glu Pro Gly Ser Ala Gly Pro Pro Gly Pro Ala Gly Leu Arg Gly Val 
385 390 395 400 


Pro Gly Ser Arg Gly Leu Pro Gly Ala Asp Gly Arg Ala Gly Val Met 
405 410 415 


Gly Pro Ala Gly Asn Arg Gly Ala Ser Gly Pro Val Gly Ala Lys Gly 
420 425 430 


Pro Asn Gly Asp Ala Gly Arg Pro Gly Glu Pro Gly Leu Met Gly Pro 
435 440 445 


Arg Gly Leu Pro Gly Gln Pro Gly Ser Pro Gly Pro Ala Gly Lys Glu 
450 455 460 


Gly Pro Val Gly Phe Pro Gly Ala Asp Gly Arg Val Gly Pro Ile Gly 
465 470 475 480 


Pro Ala Gly Asn Arg Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro 
485 490 495 


Lys Gly Pro Thr Gly Glu Pro Gly Lys Pro Gly Glu Lys Gly Asn Val 
500 505 510 


Gly Leu Ala Gly Pro Arg Gly Ala Pro Gly Pro Glu Gly Asn Asn Gly 
515 520 525 


Ala Gln Gly Pro Pro Gly Val Thr Gly Asn Gln Gly Ala Lvs Gly Glu 
530 535 540 


Thr Gly Pro Ala Gly Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro Ser 
545 550 555 560 


Gly Pro Ala Gly Glu Ala Gly Lvs Pro Gly Glu Arg Gly Leu His Gly 
565 570 575 


Glu Phe Gly Val Pro Gly Pro Ala Gly Pro Arg Gly Glu Arg Gly Leu 
580 585 590 
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-continued 
Pro Gly Glu Ser Gly Ala Val Gly Pro Ala Gly Pro Pro Gly Ser Arg 
595 600 605 


Gly Pro Ser Gly Pro Pro Gly Pro Asp Gly Asn Lys Gly Glu Pro Gly 
610 615 620 


Asn Val Gly Pro Ala Gly Ala Pro Gly Pro Ala Gly Pro Gly Gly Ile 
625 630 635 640 


Pro Gly Glu Arg Gly Val Ala Gly Val Pro Gly Gly Lys Gly Glu Lys 
645 650 655 


Glv Ala Pro Gly Leu Arg Gly Asp Thr Gly Ala Thr Gly Arg Asp Gly 
660 665 670 


Ala Arg Gly Leu Pro Gly Ala Ile Gly Ala Pro Gly Pro Ala Gly Gly 
675 680 685 


Ala Gly Asp Arg Gly Glu Gly Gly Pro Ala Gly Pro Ala Gly Pro Ala 
690 695 700 


Gly Ala Arg Gly Ile Pro Gly Glu Arg Gly Glu Pro Gly Pro Val Gly 
705 710 715 720 


Pro Ser Gly Phe Ala Gly Pro Pro Gly Ala Ala Gly Gln Pro Gly Ala 
725 730 735 


Lys Gly Glu Arg Gly Pro Lys Gly Pro Lys Gly Glu Thr Gly Pro Thr 
740 745 750 


Gly Ala Ile Gly Pro Ile Gly Ala Ser Gly Pro Pro Gly Pro Val Gly 
755 760 765 


Ala Ala Gly Pro Ala Gly Pro Arg Gly Asp Ala Gly Pro Pro Gly Met 
770 775 780 


Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ala 
785 790 795 800 


Gly Ile Thr Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys Asp Gly 
805 810 815 


Pro Arg Gly Leu Arg Gly Asp Val Gly Pro Val Gly Arg Thr Gly Glu 
820 825 830 


Gln Gly Ile Ala Gly Pro Pro Gly Phe Ala Gly Glu Lys Gly Pro Ser 
835 840 845 


Gly Glu Ala Gly Ala Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly 
850 855 860 


Ile Leu Gly Ala Pro Gly Ile Leu Gly Leu Pro Gly Ser Arg Gly Glu 
865 870 875 880 


Arg Gly Leu Pro Gly Ile Ala Gly Ala Thr Gly Glu Pro Gly Pro Leu 
885 890 895 


Gly Val Ser Gly Pro Pro Gly Ala Arg Gly Pro Ser Gly Pro Val Gly 
900 905 910 


Ser Pro Gly Pro Asn Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn 
915 920 925 


Pro Gly Asn Asp Gly Pro Pro Gly Arg Asp Gly Ala Pro Gly Phe Lys 
930 935 940 


Gly Glu Arg Gly Ala Pro Gly Asn Pro Gly Pro Ser Gly Ala Leu Gly 
945 950 955 960 


Ala Pro Gly Pro His Gly Gln Val Gly Pro Ser Gly Lys Pro Gly Asn 
965 970 975 


Arg Gly Asp Pro Gly Pro Val Gly Pro Val Gly Pro Ala Gly Ala Phe 
980 985 990 
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-continued 


Gly Pro Arg Gly Leu Ala Gly Pro Gln Gly Pro Arg Gly Glu Lys Gly 
995 1000 1005 


Glu Pro Gly Asp Lys Gly His Arg Gly Leu Pro Gly Leu Lys Gly 
010 015 1020 


His Asn Gly Leu Gln Gly Leu Pro Gly Leu Ala Gly Gln His Gly 
025 030 1035 


Asp Gln Gly Pro Pro Gly Asn Asn Gly Pro Ala Gly Pro Arg Gly 
040 045 1050 


Pro Pro Gly Pro Ser Gly Pro Pro Gly Lys Asp Gly Arg Asn Gly 
055 060 1065 


Leu Pro Giv Pro Ile Gly Pro Ala Gly Val Arg Gly Ser His Gly 
070 075 1080 


Ser Gln Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 
085 090 1095 


Pro Pro Gly Pro Asn Gly Gly Gly Tyr Glu Val Gly Phe Asp Ala 
100 105 1110 


Glu Tyr Tyr Arg Ala Asp Gln Pro Ser Leu Arg Pro Lys Asp Tyr 
1115 1120 1125 


Glu Val Asp Ala Thr Leu Lys Thr Leu Asn Asn Gln Ile Glu Thr 
1130 1135 1140 


Leu Leu Thr Pro Glu Gly Ser Lys Lys Asn Pro Ala Arg Thr Cys 
1145 1150 1155 


Arg Asp Leu Arg Leu Ser His Pro Glu Trp Ser Ser Gly Phe Tyr 
1160 1165 1170 


Trp Ile Asp Pro Asn Gln Gly Cys Thr Ala Asp Ala Ile Arg Ala 
1175 1180 1185 


Tyr Cys Asp Phe Ala Thr Gly Glu Thr Cys Ile His Ala Ser Leu 
1190 1195 1200 


Glu Asp Ile Pro Thr Lys Thr Trp Tyr Val Ser Lys Asn Pro Lys 
1205 1210 1215 


Asp Lys Lys His Ile Trp Phe Gly Glu Thr Ile Asn Gly Gly Thr 
1220 1225 1230 


Gln Phe Glu Tyr Asn Gly Glu Gly Val Thr Thr Lys Asp Met Ala 
1235 1240 1245 


Thr Gln Leu Ala Phe Met Arg Leu Leu Ala Asn His Ala Ser Gln 
1250 1255 1260 


Asn Ile Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu 
1265 1270 1275 


Glu Thr Gly Asn Leu Lys Lys Ala Val Ile Leu Gln Gly Ser Asn 
280 1285 1290 


Asp Val Glu Leu Arg Ala Glu Gly Asn Ser Arg Phe Thr Phe Ser 
295 1300 1305 


Val Leu Val Asp Gly Cys Ser Lys Lys Asn Asn Lys Trp Gly Lys 
310 1315 1320 


Thr Ile Ile Glu Tyr Arg Thr Asn Lys Pro Ser Arg Leu Pro Ile 
325 1330 1335 


Leu Asp Ile Ala Pro Leu Asp Ile Gly Gly Ala Asp Gln Glu Phe 
340 1345 1350 


Gly Leu His Ile Gly Pro Val Cys Phe Lys 
355 1360 
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<210> SEQ ID NO 2 
<211> LENGTH: 4362 


<212> TYPE: 


DNA 


<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 
<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

polynucleotide 


<400> SEQUENCE: 2 


atgttcaget 
agaggccaag 
aacgacaagg 
atcctctgcg 
ttcggcgagt 
gctggagtag 
ccacccggca 
ggcccaccog 
agtgctggag 
ccgcccggcg 
ggtgcgagtg 
gacggggaag 
getegeggec 
ggactggatg 
cccggcgaga 
cgecctggcc 
ggacctcccg 
aagggggaaa 
gagcccggcc 
ggtcaacctg 
cccggggcac 
aatagtggag 
ggccccgccg 
cgtggtgaac 
agcagaggct 
ggttctcccg 
cceggettge 


aaaaccggcc 


ggggcaaggg 


cctgggaagc 


aaagacggtg 


ggtgaacaag 


cccggcgagg 


tcgtggacag 
gegaagagga 
acgtgtggaa 
acgaggtgat 
gctgtcccat 
agggtcctaa 
gagacggaat 
gactgggggg 
ttgccgtccc 
cacccggccc 
gtccaatggg 
ccggcaagcc 
tgcccggcac 
gtgcaaaagg 
atggagcccc 
cttctggtcc 
gtcctactgg 
ccggccctca 
cccccggccc 
gcgccaaagg 
gcggtgcgcc 
agcccggcgc 
gagtgcaagg 
ccgggccggc 
ttcccggcgc 
gegeegtegg 
ccggcgctaa 


coccgggtec 


gecaagecgg 


ccggtgagag 


aggccggagc 


gcccggccgg 


cgggcaaacc 


cagactgctg 
cattcagacc 
gcccgagccc 
ctgcgaggac 
ctgccccgac 
gggcgacacg 
acccggccaa 
gaacttcgcc 
cggcccgatg 
gcaaggtttt 
accgcgaggg 
tggccgtccc 
ggccggtttg 
agaaccggga 
gggacagatg 
cgccggtgct 
accggccggc 
aggagcaagg 
cgccggtgcc 
tgcaaccgga 
cggccctcaa 
gcccggcaat 
accgccgggt 
gggtctgccg 
agatggtatt 
tccgaaagga 
aggtttgacc 


cgccggacaa 


ggttatggga 


aggtgctccc 


gcaaggccca 


tgctcccgga 


tggggaacaa 


ctgctgatcg 
ggcagctgcg 
tgtcagatct 
acaagcgact 
gtagatgcta 
ggaccccggg 
cccggtctac 
ccacagatga 
ggccccgccg 
caaggccctc 
cccgccggcc 
ggggagaggg 
cccggcatga 
ceggegggac 
ggcccccgag 
cgtgggaatg 
ccaccgggct 
ggttccgagg 
gegggaccgg 
gctccgggca 
ggaccaagcg 
aaaggcgata 
cctgctggcg 
gggccggctg 
gcgggtccaa 
agtcctggtg 
ggctcgcccg 
gacggtcggc 
tttcctgggc 
ggcccgcccg 


ccgggaccaa 


tttcaaggcc 


ggcgttccog 


16 


-continued 


ccgccaccgt 
tgcaagacgg 
gcgtgtgcga 
gccccaacgc 
gccccgtgta 
gggatcgtgg 
ctggccctcc 
gctatgggta 
geccacgggg 
ccggcgagcc 
cacccggaaa 
gcccccccgg 
agggccatag 
cgaaggggga 
gtctccccgg 
atggggcgcc 
ttcctggggc 
gtccccaagg 
ccgggaatcc 
tagctggtgc 
gcgctcctgg 
ccggcgccaa 
aagaaggcaa 
gtgagagagg 
aaggacctcc 
aggctggaag 
gctcacctgg 


ccggcccgcc 


caaagggcgc 


gcgcagtggg 


ccgggcctgc 


tgcccggccc 


gagacgccgg 


gctgctgaca 
tctgacttac 
cagcggtaac 
cgagatcccc 
ccccgagtcg 
acttcccggc 
tggccccccg 
cgacgagaaa 
tetaccegge 
cggcgaaccg 
gaatggcgac 
cccccaaggc 
aggtttcagc 
acccgggtcc 
cgaacgagga 
eggegetget 
cgecggtgca 
tgcgaggggt 
cggggcagac 
tcccggcttc 
tcctaaggga 
aggggagccc 
acgcggcgct 
agcgcccggc 
cggtgagagg 
gcceggtgaa 
acctgatggt 


tgggectcet 


agccggcgaa 


agctgccggc 


cggcgagcgc 


tgegggecec 


tgcccccggg 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1620 


1680 


1740 


1800 


1860 


1920 


1980 
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-continued 
ccggctggag cgaggggtga acgtggcttt ccgggggaac gaggtgtgca aggccctccc 2040 
ggcccccaag gcccccgtgg cgccaatgga gcccccggca atgatggggc taaaggcgac 2100 
gctggggcgc ceggagctcc cggcaaccaa ggaccgcctg gattacaagg gatgcctggg 2160 
gaacgcggeg ccgccggcct tcccggcgcg aaaggcgata ggggcgaccc cggacccaag 2220 
ggtgccgacg gggctcccgg caaggatggc ctacggggge tgactgggcc aattgggccc 2280 
cctggeccgg ccggegcacc cggcgacaag ggtgaagccg gcccgagcgg ccctccgggc 2340 
cccaccggcg cccgaggagc tcccggcgat cgtggcgaac ccggccctcc cggccccgcc 2400 
ggatttgctg gcccgccggg tgcagacggc caacctggcg ctaagggcga aaccggcgac 2460 
gegggegcaa aaggcgatgc tggcccaccc ggccctgccg gacctacggg ggctcccggc 2520 
cctgctgggg cggtgggtgc tcctgggccg aagggagcga gaggctcagc cgggcccccc 2580 
ggcgcgacgg gtttccctgg agcggccggc cgtgtcgggc cccccggtcc ttctggtaat 2640 
ataggcttac ccggcccgcc cggtccaagt ggaaaggagg ggggtaaagg gccgagaggc 2700 
gagactggcc ccgcggggag gcccggcgag ccgggacccg ctgggccacc tggtccccct 2760 
ggegaaaagg gcagccccgg cgccgacggg cccataggag cccccggcac ccctggtccc 2820 
caaggtatag ccggacagag gggggtggtc ggactccccg gtcaacgegg tgagagagga 2880 
tttccgggcc tgecegggec aagcggagaa cccggaaaac aaggtccgag cggctcgccg 2940 
ggggagcgtg gtcctcctgg accaatgggt ccgcctggtc ttgccggccc ccccggagaa 3000 
gegggcagag agggagcacc gggggccgag ggcgcccccg gccgtgacgg cgcagccggg 3060 
cctaagggcg atcgagggga aaccggccce gecggaccac ccggcgcgcc cggcgcgccc 3120 
ggtgecectg ggecegttgg gcctgccggg aaaaacggtg accggggcga aaccgggecg 3180 
gecgggectg cgggaccacc tgggcctgct ggagcacgag gacccgccgg accacaaggc 3240 
cegegeggtg ataaaggtga gaccggcgag caaggcgaca gaggcatgaa aggccataga 3300 
ggctttagcg gcctacaagg cccgcccggc cctcccggcg ccccgggtga acaagggccc 3360 
tctggcgcat ctggacctgc cggcccacga ggcccgcctg gcagcgccgg ggcagctgga 3420 
aaggacggtc ttaatggctt gcctgggcca ataggtccac ccggtccaag aggtaggacc 3480 
ggegaggtag gtcctgtcgg tccgcccggg ccgcccggtc cccccggccc gcctggcccc 3540 
cectceggeg gtttcgactt cagcttcctg ccacagccce cccaagagaa ggcccacgac 3600 
ggcggcagat actacagagc cgacgacgcc aacgtgatga gagacagaga cctggaggtg 3660 
gacaccaccc tgaagagcct gtctcagcag atcgagaaca tcagaagccc cgagggcaca 3720 
agaaagaacc ccgctagaac ctgcagagac ctgaagatgt gccacggcga ctggaagagc 3780 
ggegagtact ggatcgaccc caaccaagge tgcaacctgg acgccatcaa ggtgtactgc 3840 
aacatggaga cgggagagac atgcgtgtac cccacccaag ccaccatcgc tcagaagaac 3900 
tggtatctga gcaagaaccc caaggagaag aagcacgtgt ggttcggcga gaccatgagc 3960 
gacggctttc agttcgagta cggcggcgag ggcagcaacc ccgccgacgt ggccattcag 4020 
ctgaccttcc tgagactgat gagcaccgag gccacacaga acgtgaccta ccactgcaag 4080 
aacagcgtgg cctacatgga ccacgacacc ggcaacctga agaaggccct gctgctgcaa 4140 
gġegccaacg agatcgagat cagagccgag ggcaacagca gattcaccta cggcgtgacc 4200 


gaggacggct gcacaagcca caccggcgcc tggggcaaga ccgtgatcga gtacaagacc 4260 
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-continued 


accaagacaa gcagactgcc catcatcgac ctggccccca tggacgtggg tgctcctgat 4320 


caagagttcg gcatcgacat cggccccgtg tgcttcctgt ga 


<210> 
<211> 
<212> 
<213> 
<220> 
<223> 


PRT 


SEQ ID NO 3 
LENGTH: 
TYPE: 
ORGANISM: Artificial Sequence 
FEATURE: 
OTHER INFORMATION: Description of 


1453 


polypeptide 


<400> SEQUENCE: 


Met 


i 


Cys 


Glu 


Glu 


65 


Phe 


Tyr 


Arg 


Gly 


Leu 


145 


Ser 


Gly 


Pro 


Arg 


Gly 


225 


Ala 


Arg 


Gly 


Gin 


Ser 


305 


Gly 


Phe 


Leu 


Val 


Pro 


50 


Val 


Gly 


Pro 


Gly 


Gln 


130 


Gly 


Ala 


Leu 


Pro 


Gly 


210 


Lys 


Arg 


Gly 


Pro 


Met 
290 


Gly 


Pro 


Ser 


Leu 


Gln 


35 


Cvs 


Ile 


Glu 


Glu 


Asp 


115 


Pro 


Gly 


Gly 


Pro 


Gly 


195 


Pro 


Pro 


Gly 


Phe 


Lys 
275 
Gly 


Pro 


Pro 


Phe 


Thr 


20 


Asp 


Gln 


Cvs 


Cvs 


Ser 


100 


Arg 


Gly 


Asn 


Val 


Gly 


180 


Glu 


Ala 


Gly 


Leu 


Ser 


260 


Gly 


Pro 


Ala 


Gly 


3 


Gly 


Ile 


Glu 


Cvs 


85 


Ala 


Gly 


Leu 


Phe 


Ala 


165 


Pro 


Pro 


Gly 


Arg 


Pro 


245 


Gly 


Glu 


Arg 


Gly 


Pro 


Asp 


Gly 


Leu 


Cvs 


Asp 


70 


Pro 


Gly 


Leu 


Pro 


Ala 


150 


Val 


Pro 


Gly 


Pro 


Pro 


230 


Gly 


Leu 


Pro 


Gly 


Ala 
310 


Thr 


Ser 


Gln 


Thr 


val 


55 


Thr 


Tle 


val 


Pro 


Gly 


135 


Pro 


Pro 


Gly 


Glu 


Pro 


215 


Gly 


Thr 


Asp 


Gly 


Leu 
295 


Arg 


Gly 


Arg 


Gly 


Tyr 


40 


Cys 


Ser 


Cys 


Glu 


Gly 


120 


Pro 


Gln 


Gly 


Ala 


Pro 


200 


Gly 


Glu 


Ala 


Gly 


Ser 


280 


Pro 


Gly 


Pro 


Leu 


Glu 


25 


Asn 


Asp 


Asp 


Pro 


Gly 


105 


Pro 


Pro 


Met 


Pro 


Pro 


185 


Gly 


Lys 


Arg 


Gly 


Ala 


265 


Pro 


Gly 


Asn 


Ala 


Leu 


10 


Glu 


Asp 


Ser 


Cys 


Asp 


90 


Pro 


Pro 


Gly 


Ser 


Met 


170 


Gly 


Ala 


Asn 


Gly 


Leu 


250 


Lys 


Gly 


Glu 


Asp 


Gly 


Artificial Sequence: 


Leu 


Asp 


Lys 


Gly 


Pro 


75 


Val 


Lys 


Gly 


Pro 


Tyr 


155 


Gly 


Pro 


Ser 


Gly 


Pro 


235 


Pro 


Gly 


Glu 


Arg 


Gly 


alo 


Pro 


Leu 


Ile 


Asp 


Asn 


60 


Asn 


Asp 


Gly 


Arg 


Pro 


140 


Gly 


Pro 


Gln 


Gly 


Asp 


220 


Pro 


Gly 


Glu 


Asn 


Gly 
300 


Ala 


Pro 


Ile 


Gln 


val 


45 


Ile 


Ala 


Ala 


Asp 


Asp 


125 


Gly 


Tyr 


Ala 


Gly 


Pro 


205 


Asp 


Gly 


Met 


Pro 


Gly 
285 
Arg 


Pro 


Gly 


Ala 


Thr 


30 


Trp 


Leu 


Glu 


Ser 


Thr 


110 


Gly 


Pro 


Asp 


Gly 


Phe 


190 


Met 


Gly 


Pro 


Lys 


Gly 


270 


Ala 


Pro 


Gly 


Phe 


Ala 


15 


Gly 


Lys 


Cys 


Ile 


Pro 


95 


Gly 


Ile 


Pro 


Glu 


Pro 


175 


Gln 


Gly 


Glu 


Gin 


Gly 


255 


Pro 


Pro 


Gly 


Ala 


Pro 


4362 


Synthetic 


Thr 


Ser 


Pro 


Asp 


Pro 


80 


Val 


Pro 


Pro 


Pro 


Ala 


Gly 


240 


His 


Ala 


Gly 


Pro 


Ala 
320 


Gly 
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-continued 
325 330 335 


Ala Ala Gly Ala Lvs Gly Glu Thr Gly Pro Gln Gly Ala Arg Gly Ser 
340 345 350 


Glu Gly Pro Gln Gly Ala Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala 
355 360 365 


Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ala Asp Gly Gln Pro Gly 
370 375 380 


Ala Lys Gly Ala Thr Gly Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe 
385 390 395 400 


Pro Gly Ala Arg Gly Ala Pro Gly Pro Gln Gly Pro Ser Gly Ala Pro 
405 410 415 


Gly Pro Lvs Gly Asn Ser Gly Glu Pro Gly Ala Pro Gly Asn Lys Gly 
420 425 430 


Asp Thr Gly Ala Lvs Gly Glu Pro Gly Pro Ala Gly Val Gln Gly Pro 
435 440 445 


Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro 
450 455 460 


Gly Pro Ala Gly Leu Pro Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly 
465 470 475 480 


Ser Arg Gly Phe Pro Gly Ala Asp Gly Ile Ala Gly Pro Lys Gly Pro 
485 490 495 


Pro Gly Glu Arg Gly Ser Pro Gly Ala Val Gly Pro Lys Gly Ser Pro 
500 505 510 


Gly Glu Ala Gly Arg Pro Gly Glu Pro Gly Leu Pro Gly Ala Lys Gly 
515 520 525 


Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro 
530 535 540 


Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro 
545 550 555 560 


Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe Pro Gly Pro Lys Gly 
565 570 575 


Ala Ala Gly Glu Pro Gly Lys Pro Gly Glu Arg Gly Ala Pro Gly Pro 
580 585 590 


Pro Gly Ala Val Gly Ala Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln 
595 600 605 


Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly 
610 615 620 


Pro Ala Gly Ala Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro 
625 630 635 640 


Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Ala 
645 650 655 


Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly 
660 665 670 


Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Gln Gly Pro Arg Gly Ala 
675 680 685 


Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro 
690 695 700 


Gly Ala Pro Gly Asn Gln Gly Pro Pro Gly Leu Gln Gly Met Pro Gly 
705 710 715 720 


Glu Arg Gly Ala Ala Gly Leu Pro Gly Ala Lys Gly Asp Arg Gly Asp 
725 730 735 
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-continued 
Pro Gly Pro Lys Gly Ala Asp Gly Ala Pro Gly Lys Asp Gly Leu Arg 
740 745 750 


Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly 
755 760 765 


Asp Lys Gly Glu Ala Gly Pro Ser Gly Pro Pro Gly Pro Thr Gly Ala 
770 775 780 


Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala 
785 790 795 800 


Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly 
805 810 815 


Glu Thr Gly Asp Ala Gly Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro 
820 825 830 


Ala Gly Pro Thr Gly Ala Pro Gly Pro Ala Gly Ala Val Gly Ala Pro 
835 840 845 


Gly Pro Lys Gly Ala Arg Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly 
850 855 860 


Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ser Gly Asn 
865 870 875 880 


Ile Gly Leu Pro Gly Pro Pro Gly Pro Ser Gly Lys Glu Gly Gly Lys 
885 890 895 


Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg Pro Gly Glu Pro Gly 
900 905 910 


Pro Ala Gly Pro Pro Gly Pro Pro Gly Glu Lys Gly Ser Pro Gly Ala 
915 920 925 


Asp Gly Pro Ile Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala 
930 935 940 


Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly 
945 950 955 960 


Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro 
965 970 975 


Ser Gly Ser Pro Gly Glu Arg Gly Pro Pro Gly Pro Met Gly Pro Pro 
980 985 990 


Gly Leu Ala Gly Pro Pro Gly Glu Ala Gly Arg Glu Gly Ala Pro Gly 
995 1000 1005 


Ala Glu Gly Ala Pro Gly Arg Asp Gly Ala Ala Gly Pro Lys Gly 
1010 1015 1020 


Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Pro Gly Ala Pro Gly 
025 1030 1035 


Ala Pro Gly Ala Pro Gly Pro Val Gly Pro Ala Gly Lys Asn Gly 
040 1045 1050 


Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Ala Gly Pro Pro Gly 
055 1060 1065 


Pro Ala Giv Ala Arg Gly Pro Ala Gly Pro Gln Gly Pro Arg Gly 
070 075 1080 


Asp Lys Gly Glu Thr Gly Glu Gln Gly Asp Arg Gly Met Lys Gly 
085 090 1095 


His Arg Gly Phe Ser Gly Leu Gln Gly Pro Pro Gly Pro Pro Gly 
100 105 1110 


Ala Pro Gly Glu Gln Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly 
115 120 1125 


US 2022/0333081 Al Oct. 20, 2022 
21 


-continued 


Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Ala Gly Lys Asp Gly 
30 35 1140 


Leu Asn Gly Leu Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly 
45 50 1155 


Arg Thr Gly Glu Val Gly Pro Val Gly Pro Pro Gly Pro Pro Gly 
60 65 1170 


Pro Pro Gly Pro Pro Gly Pro Pro Ser Gly Gly Phe Asp Phe Ser 
75 80 1185 


Phe Leu Pro Gln Pro Pro Gln Glu Lys Ala His Asp Gly Gly Arg 
90 95 1200 


Tyr Tyr Arg Ala Asp Asp Ala Asn Val Met Arg Asp Arg Asp Leu 
205 210 1215 


Glu Val Asp Thr Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn 
220 225 1230 


Ile Arg Ser Pro Glu Gly Thr Arg Lys Asn Pro Ala Arg Thr Cys 
235 240 1245 


Arg Asp Leu Lys Met Cys His Gly Asp Trp Lys Ser Gly Glu Tyr 
1250 1255 1260 


Trp Ile Asp Pro Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val 
1265 1270 1275 


Tyr Cys Asn Met Glu Thr Gly Glu Thr Cys Val Tyr Pro Thr Gln 
1280 1285 1290 


Ala Thr Ile Ala Gln Lys Asn Trp Tyr Leu Ser Lys Asn Pro Lys 
1295 1300 1305 


Glu Lys Lys His Val Trp Phe Gly Glu Thr Met Ser Asp Gly Phe 
1310 1315 1320 


Gln Phe Glu Tyr Gly Gly Glu Gly Ser Asn Pro Ala Asp Val Ala 
1325 1330 1335 


Ile Gln Leu Thr Phe Leu Arg Leu Met Ser Thr Glu Ala Thr Gln 
1340 1345 1350 


Asn Val Thr Tyr His Cys Lys Asn Ser Val Ala Tyr Met Asp His 
1355 1360 1365 


Asp Thr Gly Asn Leu Lys Lys Ala Leu Leu Leu Gln Gly Ala Asn 
1370 1375 1380 


Glu Ile Glu Ile Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Gly 
1385 1390 1395 


Val Thr Glu Asp Gly Cys Thr Ser His Thr Gly Ala Trp Gly Lys 
1400 1405 1410 


Thr Val Ile Glu Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro Ile 
1415 1420 1425 


Ile Asp Leu Ala Pro Met Asp Val Gly Ala Pro Asp Gln Glu Phe 
1430 1435 1440 


Gly Ile Asp Ile Gly Pro Val Cys Phe Leu 


<210> SEQ ID NO 4 

<211> LENGTH: 4092 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
polynucleotide 


<400> SEQUENCE: 4 
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atgetgaget 
acaagccaac 
ccgcaaggtg 
ggeccacctg 
ccaagcaagg 
ccgggtgcga 
gagcccggcc 
ggggaagacg 
caaggcgcga 
cacaacggcc 
ggcgctcccg 
aggggacgaa 
cccactggtc 
ggtgctaagg 
agaggggaaa 
gegaatgget 
ggattgcccg 
aggggactag 
gaacccggcg 
gggagtaatg 
cctggaagca 
aacagaggag 
ggcgaacctg 
gegggaaaag 
ccggccggaa 
ggtgagccgg 
ccgggccccg 
gctaagggcg 
gggcccgccg 
cccgggcccg 
cccgccgggc 
ggtgaacctg 


cctggcgaac 


ctgagagggg 


ggagcccccg 


gccggccccg 


ccaagcgggt 


gggeccaagg 


tegtggacac 
acgtgagcga 
agcgaggacc 
gaccccccgg 
cagccgactt 
gcgggcctcc 
aaactggccc 
gtcatccggg 
ggggctttcc 
tggacggaca 
gtgaaaatgg 
tcggtgcgcc 
cggccggccc 


gggaaatagg 


tcggcctgcc 
tacccggcgc 
gSaccccgggg 
taggggagcc 
cegetggtec 
gggagecegg 
gagggttgcc 
caagcggccc 
gtctcatggg 
aagggectgt 
ataggggaga 


gtaagcccog 


aaggcaacaa 


aaacgggacc 


gcgaagccgg 


ccgggccgcg 


cgeccggctc 


gcaatgtggg 


ggggtgtgge 


ataccggcgc 


gcccggccgg 


ccggtgcgcg 


ttgctgggcc 


gaccaaaggg 


aagaatcctg 
ggcaagegec 
ccccggcccc 
ccctcctggg 
tgggectggt 
eggacegect 
gcaagggcca 
caagccgggg 
tggtacgccc 
gaagggacaa 
cacccctgga 
cggccccgcc 
cattggggcc 
acccgcgggg 
cgggtcaagc 
gaaaggagcc 
gattccegge 
cggccccgcc 
accgggaccg 
cagcgctggc 
gggcgcagat 
cgttggtgcc 
accacgcgga 
gggatttcct 
acccggtaac 
cgagaaaggt 
tggcgcgcaa 
cgccggtcct 
caaacccggc 
tggegagcgg 
tegagggect 


ccccgccggc 


cggcgtgcct 


taccggccgg 


cggtgccggc 


aggtatacct 


gcccggtgcc 


tgagactggc 


ctgctgctgg 
ggtagaaaag 
cccggtcgtg 
ctgggaggca 
ccaatgggcc 
ggttttcaag 
aggggaccac 
aggcctggag 
gggttacccg 
cccggtactc 
caacccgggg 
ggcgcaaggg 
gccggcccgc 
aacgtaggcc 
ggccctgtcg 
actgggttgc 
ccccccgggc 
ggegegaagg 
cctggaccgt 
cctcccggac 
ggcagagctg 
aagggcccga 
ttgccggggc 
ggagcggatg 
atcgggttcc 
aatgtgggat 
ggcccgcccg 
cctggattcc 
gaacgtggcc 
ggactacctg 
tcgggcccgc 
gctccgggac 
ggaggcaagg 
gacggagccc 
gacagaggag 
ggtgagaggg 
gegggacaac 


cctacggggg 


22 


-continued 


cegtgacaag ctacctggcg 
ggccgcgtgg ggacaagggc 
atggtgaaga cggacctccc 
attttgccgc tcagtacgat 
tcatgggtcc gagaggtcca 
gcgttccggg agagcceggt 
ccggcccgcc tggtaaggct 
aacggggtgt agctgggccg 
ggtttaaagg cattagaggg 
ccggcacaaa gggcgaaccc 
cgagaggact cccceggcgag 
getccgatgg gagtgccggc 
ceggetttcc gggggcaccc 
cgaccggccc cgctggacct 
ggccccccgg caatcccggc 
ctggcgtggc cggcgcaccc 
cggccggccc atcgggtgct 
gcgagagcgg caataaagga 
ctggggagga agggaaaaga 
ctgccggcct ccgaggagta 
gcgtgatggg tccggctggt 
acggtgatgc cggccgaccc 
agcctggctc acctggcccg 
gccgggttgg tcctatcgga 
cgggccccaa gggtccgacg 
tagctggacc aagaggggca 
gggttacggg caaccaaggg 
aaggacttcc ggggccgagc 
tgcatggcga gttcggcgtg 
gtgagagegg tgcggttgga 
ccggcccgga tggtaacaag 
ccgccggacc cgggggcata 
gegaaaaggg ggctccgggg 
gggggttacc cggcgcgatc 
agggcggccc cgccggtccc 
gcgaacctgg acctgttggc 
ccggcgccaa gggagagcga 


ctataggccc gattggtgcg 


60 


120 


180 


240 


300 


360 


420 


480 


540 


600 


660 


720 


780 


840 


900 


960 


1020 


1080 


1140 


1200 


1260 


1320 


1380 


1440 


1500 


1560 


1620 


1680 


1740 


1800 


1860 


1920 


1980 


2040 


2100 


2160 


2220 


2280 
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-continued 


agtggccccc ctggccctgt gggcgccgcg ggtccggccg ggcctcgggg agatgetggg 2340 


ccacccggaa tgactggttt cccgggtgca gccggccgtg tcggtccacc gggccccgcc 2400 
ggcataactg gcccgcccgg tccgccgggg ccggccggca aggacggacc aagaggtctt 2460 
cgtggtgacg tggggcccgt gggaagaacc ggcgagcaag gcattgcggg tccccctggc 2520 
tttgccggcg aaaaaggccc gtcgggtgaa gcgggggcag ctggaccccc cggaactcct 2580 
ggcccccaag gcatacttgg agcgcccggc atacttggcc ttcccggctc aagaggcgaa 2640 
cgagggctac ctggcatcgc tggggccact ggggaacccg gaccacttgg cgtgagtggc 2700 
cctcccggtg cgagggggcc aagtggtccc gttggttctc ccggccctaa tggagcacct 2760 
ggggaagccg gccgtgacgg taatcctggc aatgatggac cacccggccg tgatggtgca 2820 
cctgggttca agggagaaag aggagccccc ggcaaccccg gtccgtccgg cgccttagga 2880 
gcacccggcc cacacgggca agtggggcca agtgggaagc cggggaatcg gggcgacccc 2940 
ggaccggtag ggcccgtggg tcccgccggc gccttcggtc ctcgagggct agccgggcca 3000 
caaggcccac gcggcgaaaa gggcgagccc ggcgacaagg gccacagagg cttgcccggg 3060 
ttaaaaggtc acaacggact gcaaggcctc cccggtctgg cgggacagca tggtgaccaa 3120 
ggcccgccgg ggaacaacgg ccccgcgggc ccgcgtggcc ctccggggcc ctcgggtccg 3180 


cctgggaagg atggacggaa cggcttgccc ggaccgatag gaccggccgg ggtccgtggc 3240 


agccacggca gccaaggccc cgccggccct cccggcccgc ccggccctcc cggcccacct 3300 
ggcccaaatg ggggaggcta cgaggtgggc ttcgacgccg agtactacag agccgatcag 3360 
ccaagcctga gacccaagga ctacgaggtg gacgccaccc tgaagaccct gaacaatcag 3420 
atcgagaccc tgctgacccc cgagggtagc aaaaagaacc cggctagaac ctgcagagac 3480 
ctgagactga gccaccccga gtggagcagc ggcttctact ggatcgaccc caaccaaggc 3540 
tgcaccgccg acgccatcag agcctactgc gacttcgcca ccggcgagac ctgcatccac 3600 
gctagcctgg aggacatccc caccaagacc tggtacgtga gcaagaaccc caaggacaag 3660 
aagcacatct ggttcggcga gaccatcaac ggcggcacac agttcgagta caacggcgag 3720 
ggcgtgacca ccaaggacat ggccacacag ctggccttca tgagactgct ggccaaccac 3780 
gcttctcaga acatcaccta ccactgcaag aacagcatcg cctacatgga cgaggagacc 3840 
ggcaacctga agaaggccgt gatcctgcaa ggcagcaacg acgtggagct gagagctgag 3900 
gggaattccc gtttcacctt cagcgtgctg gtggacggct gcagcaagaa gaataacaag 3960 
tggggcaaga ccatcatcga gtacagaacc aacaagccaa gcagactgcc catcctggac 4020 
atcgcccccc tggacatcgg cggcgccgac caagagttcg gcctgcacat cggccccgtg 4080 


tgcttcaagt ga 4092 


<210> SEQ ID NO 5 

<211> LENGTH: 1985 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
polypeptide 


<400> SEQUENCE: 5 


Met Ser Ser Asp Ser Glu Met Ala Val Phe Gly Glu Ala Ala Pro Tyr 
1 5 10 L5 
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Leu 


Asp 


Lys 


Leu 


65 


Asn 


Leu 


Trp 


Tyr 


Gly 


145 


Asn 


Ile 


Ile 


Pro 


Ala 


225 


Asp 


Thr 


Ser 


Tyr 


Ile 


305 


Thr 


Ala 


Lys 


Lys 


Lys 
385 


Leu 


Arg 


Ser 


Gly 


50 


Asp 


Pro 


His 


Met 


Lys 


130 


Lys 


Ala 


Thr 


Gln 


Gln 


210 


Asn 


Asn 


Gly 


Arg 


Gln 


290 


Ser 


Val 


Ile 


Leu 


Gln 


370 


Ala 


Cvs 


Lvs 


Lvs 


35 


Met 


Asp 


Pro 


Glu 


Ile 


115 


Trp 


Lys 


Tyr 


Gly 


Tyr 


195 


Thr 


Pro 


Ser 


Lys 


Val 


275 


Ile 


Thr 


Ala 


Asp 


Thr 
355 
Arg 


Gly 


Cys 


Pro 


20 


Lys 


Ile 


Arg 


Lys 


Pro 


100 


Tyr 


Leu 


Arg 


Gln 


Glu 


180 


Phe 


Gly 


Leu 


Ser 


Leu 


260 


Thr 


Met 


Asn 


Ser 


Ile 


340 


Gly 


Glu 


Tyr 


Pro 


Glu 


Ala 


Gln 


Thr 


Phe 


85 


Ala 


Thr 


Pro 


Gln 


Phe 


165 


Ser 


Ala 


Lys 


Leu 


Arg 


245 


Ala 


Phe 


ser 


Pro 


Tle 


325 


Leu 


Ala 


Glu 


Leu 


Arg 
405 


Lys 


Cys 


Asn 


Ile 


70 


Asp 


Val 


Tyr 


val 


Glu 


150 


Met 


Gly 


Thr 


Met 


Glu 


230 


Phe 


ser 


Gln 


Asn 


Phe 


310 


Asp 


Gly 


Val 


Gin 


Met 
390 


Val 


Glu 


Phe 


Arg 


55 


Thr 


Lys 


Leu 


Ser 


Tyr 


135 


Ala 


Leu 


Ala 


Ile 


Gin 


215 


Ala 


Gly 


Ala 


Leu 


Lys 


295 


Asp 


Asp 


Phe 


Met 


Ala 
375 


Gly 


Lys 


Arg 


Ala 


40 


Glu 


Leu 


Ile 


Tyr 


Gly 


120 


Asn 


Pro 


Thr 


Gly 


Ala 


200 


Gly 


Phe 


Lys 


Asp 


Ser 


280 


Lys 


Phe 


Ser 


Ser 


His 


360 


Glu 


Leu 


Val 


Thr 


25 


Val 


Asn 


Asn 


Glu 


Asn 


105 


Leu 


Pro 


Pro 


Asp 


Lys 


185 


Val 


Thr 


Gly 


Phe 


Ile 


265 


Ser 


Pro 


Pro 


Glu 


Ser 


345 


Tyr 


Pro 


Asn 


Gly 


Glu 


Asp 


Asp 


Thr 


Asp 


90 


Leu 


Phe 


Glu 


His 


Arg 


170 


Thr 


Pro 


Leu 


Asn 


Ile 


250 


Glu 


Glu 


Glu 


Phe 


Glu 


330 


Glu 


Gly 


Asp 


Ser 


Asn 
410 


Ala 


Asp 


Lys 


Ala 


75 


Met 


Lys 


Cvs 


Val 


Ile 


155 


Asp 


Val 


Gly 


Glu 


Ala 


235 


Arg 


Thr 


Arg 


Leu 


Ile 


315 


Leu 


Glu 


Asn 


Gly 


Ala 


395 


Glu 


24 


-continued 


Gln Asn 


Lvs Glu 
45 


Val Thr 
60 


Gln Val 


Ala Met 


Glu Arg 


Val Thr 
125 


Val Met 
140 


Phe Ser 


Asn Gln 


Asn Thr 


Asp Lys 


205 


Asp Gln 
220 


Lys Thr 


Ile His 


Tyr Leu 


Ser Tyr 
285 


Ile Asp 
300 


Ser Gln 


Leu Ala 


Lys Val 
Met Lys 
365 


Thr Glu 
380 


Glu Met 


Tyr Val 


Arg 


30 


Val 


Val 


Phe 


Met 


Tyr 


110 


Val 


Ala 


Ile 


Ser 


Lys 


190 


Lys 


Ile 


Val 


Phe 


Leu 


270 


His 


Leu 


Gly 


Thr 


Gly 


350 


Phe 


Val 


Leu 


Thr 


Pro 


Tyr 


Lys 


Pro 


Thr 


95 


Ala 


Asn 


Tyr 


Ser 


Ile 


175 


Arg 


Lys 


Ile 


Arg 


Gly 


255 


Glu 


Ile 


Leu 


Glu 


Asp 


335 


Ile 


Lys 


Ala 


Lys 


Lys 
415 


Phe 


Val 


Thr 


Met 


80 


His 


Ala 


Pro 


Arg 


Asp 


160 


Leu 


Val 


Glu 


Gln 


Asn 


240 


Ala 


Lys 


Phe 


Leu 


Val 


320 


Asn 


Tyr 


Gin 


Asp 


Gly 


400 


Gly 
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-continued 


Gln Asn Val Gln Gln Val Thr Asn Ser Val Gly Ala Leu Ala Lys Ala 
420 425 430 


Val Tyr Glu Lys Met Phe Leu Trp Met Val Ala Arg Ile Asn Gln Gln 
435 440 445 


Leu Asp Thr Lvs Gln Ala Arg Gln Tvr Phe Ile Gly Val Leu Asp Ile 
450 455 460 


Ala Gly Phe Glu Ile Phe Asp Phe Asn Ser Leu Glu Gln Leu Cys Ile 
465 470 475 480 


Asn Phe Thr Asn Glu Lys Leu Gln Gln Phe Phe Asn His His Met Phe 
485 490 495 


Val Leu Glu Gln Glu Glu Tvr Lys Lys Glu Gly Ile Glu Trp Glu Phe 
500 505 510 


Ile Asp Phe Gly Met Asp Leu Ala Ala Cys Ile Glu Leu Ile Glu Lys 
515 520 525 


Pro Met Gly Ile Phe Ser Ile Leu Glu Glu Glu Cys Met Phe Pro Lys 
530 535 540 


Ala Thr Asp Thr Ser Phe Lys Asn Lys Leu Tyr Asp Gln His Leu Gly 
545 550 555 560 


Lys Ser Ala Asn Phe Gln Lys Pro Lys Pro Thr Lys Gly Lys Ala Glu 
565 570 575 


Ala His Phe Ser Leu Val His Tyr Ala Gly Thr Val Asp Tyr Asn Ile 
580 585 590 


Val Gly Trp Leu Asp Lys Asn Lys Asp Pro Leu Asn Glu Thr Val Val 
595 600 605 


Gly Leu Tyr Gln Lys Ser Ser Leu Lys Leu Leu Ala Phe Leu Phe Ser 
610 615 620 


Asn Tyr Ala Gly Ala Glu Ala Gly Asp Cys Gly Gly Ser Lys Lys Gly 
625 630 635 640 


Gly Lys Lys Lys Gly Ser Ser Phe Gln Thr Val Ser Ala Val Phe Arg 
645 650 655 


Glu Asn Leu Asn Lys Leu Met Thr Asn Leu Arg Ser Thr His Pro His 
660 665 670 


Phe Val Arg Cys Leu Ile Pro Asn Glu Thr Lys Thr Pro Gly Val Met 
675 680 685 


Asp His Tyr Leu Val Ile His Gln Leu Arg Cys Asn Gly Val Leu Glu 
690 695 700 


Gly Ile Arg Ile Cys Arg Lys Gly Phe Pro Ser Arg Ile Leu Tyr Gly 
705 710 715 720 


Asp Phe Arg Gln Arg Tyr Arg Ile Leu Asn Ala Ser Ala Ile Pro Glu 
725 730 735 


Gly Gln Phe Ile Asp Ser Lys Asn Ala Ser Glu Lys Leu Leu Asn Ser 
740 745 750 


Ile Asn Val Asp Arg Glu Gln Tyr Arg Phe Gly His Thr Lys Val Phe 
755 760 765 


Phe Lys Ala Gly Leu Leu Gly Leu Leu Glu Glu Met Arg Asp Glu Lys 
770 775 780 


Leu Ala Thr Leu Met Thr Arg Thr Gln Ala Val Cys Arg Gly Tyr Leu 
785 790 195 800 


Met Arg Val Glu Phe Lys Lys Met Met Glu Arg Arg Asp Ser Ile Leu 
805 810 815 


Cys Ile Gln Asp Asn Ile Arg Ser Phe Met Asn Val Lys His Trp Pro 


US 2022/0333081 Al Oct. 20, 2022 
26 


-continued 
820 825 830 


Trp Met Asn Leu Phe Phe Lys Ile Gln Pro Leu Leu Lys Ser Ala Glu 
835 840 845 


Ala Glu Lys Glu Met Ala Thr Met Lys Glu Asn Phe Lys Arg Ala Gln 
850 855 860 


Glu Glu Leu Ala Arg Ser Glu Ala Arg Arg Lys Glu Leu Glu Glu Lys 
865 870 875 880 


Met Val Ser Leu Leu Gln Glu Lys Asn Asp Leu Gln Leu Gln Val Gln 
885 890 895 


Ser Glu Thr Glu Asn Leu Met Asp Ala Glu Glu Arg Cys Glu Gly Leu 
900 905 910 


Ile Lys Ser Lys Ile Gln Leu Glu Ala Lys Val Lys Glu Leu Asn Glu 
915 920 925 


Lys Leu Glu Glu Glu Glu Glu Ile Asn Ser Glu Leu Val Ala Lys Lys 
930 935 940 


Arg Asn Leu Glu Asp Lys Cys Phe Ser Leu Lys Lys Asp Ile Asp Asp 
945 950 955 960 


Leu Glu Leu Thr Leu Thr Lys Val Glu Lys Glu Lys His Ala Thr Glu 
965 970 975 


Asn Lys Val Lys Asn Leu Ser Glu Glu Met Thr Ala Leu Glu Glu Asn 
980 985 990 


Ile Ser Lys Leu Thr Lys Glu Lys Lys Ser Leu Gln Glu Ala His Gln 
995 1000 1005 


Gln Thr Leu Asp Gly Leu Gln Val Glu Glu Asp Lys Val Asn Gly 
1010 1015 1020 


Leu Ile Lys Ile Asn Ala Lys Leu Glu Gln Gln Thr Asp Asp Leu 
1025 1030 1035 


Glu Gly Cys Leu Glu Gln Glu Lys Lys Leu Arg Ala Asp Leu Gly 
1040 1045 1050 


Arg Leu Lys Arg Lys Leu Glu Gly Asp Leu Lys Ile Ser Gln Glu 
1055 1060 1065 


Ser Ile Thr Asp Leu Glu Asn Ala Gln Gln Gln Met Glu Glu Lys 
1070 1075 1080 


Leu Lys Lys Lys Glu Phe Glu Ile Ser Gln Leu Gln Thr Lys Ile 
1085 1090 1095 


Asp Gly Glu Gln Val His Ser Leu Gln Leu Gln Lys Lys Ile Lys 
1100 1105 1110 


Glu Leu Gln Ala Arg Thr Glu Glu Leu Glu Glu Asp Ile Glu Ala 
15 1120 1125 


Glu His Thr Leu Arg Ala Lys Leu Glu Arg Gln Arg Ser Asp Leu 
30 1135 1140 


Ala Arg Glu Leu Glu Glu Ile Ser Glu Arg Leu Glu Glu Ala Ser 
45 1150 1155 


Gly Ala Thr Ser Ala Gln Val Glu Leu Asn Lys Lys Arg Glu Ala 
60 1165 1170 


Glu Phe Gln Lys Met Arg Arg Asp Leu Glu Glu Ala Thr Leu Gln 
75 1180 1185 


His Glu Ala Thr Ala Ala Thr Leu Arg Lys Lys His Ala Asp Ser 
90 1195 1200 


Val Ala Glu Leu Gly Glu Gln Ile Asp Asn Leu Gln Arg Val Lys 
205 1210 1215 
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-continued 
Gln Lys Leu Gln Lys Glu Lys Ser Glu Leu Lys Met Glu Met Asp 
220 225 1230 


Asp Met Ala Ser Asn Ile Glu Thr Ile Thr Lvs Ser Lys Ser Asn 
235 240 1245 


Met Glu Arg Met Cys Arg Thr Met Glu Asp Gln Phe Asn Glu Leu 
250 255 1260 


Lys Ala Lys Asp Asp Gln Gln Thr Gln Leu Ile His Asp Leu Asn 
265 270 1275 


Met Gln Lys Ala Arg Leu Gln Thr Gln Asn Gly Glu Leu Ser His 
280 285 1290 


Gln Val Glu Glu Lys Glu Ser Leu Ile Ser Gln Leu Ser Lys Gly 
295 300 1305 


Lys Gln Ala Leu Thr Gln Gln Leu Glu Glu Leu Lys Arg Gln Leu 
310 315 1320 


Glu Glu Glu Thr Lys Ala Lys Asn Ala Leu Ala His Ala Leu Gln 
325 1330 1335 


Ser Ser Arg His Asp Cys Asp Leu Leu Arg Glu Gln Tyr Glu Glu 
1340 1345 1350 


Glu Gln Glu Gly Lys Ala Glu Val Gln Arg Ala Leu Ser Lys Ala 
1355 1360 1365 


Asn Ser Glu Val Ala Gln Trp Arg Thr Lys Tyr Glu Thr Asp Ala 
1370 1375 1380 


Ile Gln Arg Thr Glu Glu Leu Glu Glu Ala Lys Lys Lys Leu Ala 
1385 1390 1395 


Gln Arg Leu Gln Glu Ala Glu Glu Asn Thr Glu Ala Val Asn Ser 
1400 1405 1410 


Lys Cys Ala Ser Leu Glu Lys Thr Lys Gln Arg Leu Gln Gly Glu 
1415 1420 1425 


Val Asp Asp Leu Met Leu Asp Leu Glu Arg Ala Asn Thr Thr Cys 
1430 1435 1440 


Ala Thr Leu Gly Lys Lys Gln Arg Asn Phe Asp Lys Ile Leu Ala 
1445 1450 1455 


Asp Trp Lys Gln Lys Leu Asp Glu Ser His Ala Glu Leu Glu Ala 
1460 1465 1470 


Ala Gln Lys Glu Ser Arg Ser Leu Ser Thr Glu Ile Phe Lys Met 
1475 1480 1485 


Lys Asn Ala Tyr Glu Glu Val Val Asp Gln Leu Glu Thr Leu Arg 
490 1495 1500 


Arg Glu Asn Lys Asn Leu Gln Glu Glu Ile His Asp Val Thr Glu 
505 1510 1515 


Gln Ile Ala Glu Thr Gly Lys Asn Leu Gln Glu Met Glu Lys Thr 
520 1525 1530 


Lys Lys Gln Val Glu Gln Glu Lys Ser Asp Leu Gln Val Ala Leu 
535 540 1545 


Glu Glu Val Glu Gly Ser Leu Glu His Glu Glu Ser Lys Ile Leu 
550 555 1560 


Arg Val Gln Leu Glu Leu Ser Gln Leu Lys Ser Glu Leu Asp Arg 
565 570 1575 


Lys Val Ala Glu Lys Asp Glu Glu Met Glu Gln Leu Lys Arg Asn 
580 585 1590 
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-continued 


Cys Gln Arg Ala Ala Glu Ala Thr Gln Ser Leu Leu Glu Ala Glu 
595 600 1605 


Ser Arg Ser Arg Lys Asp Ala Leu Arg Leu Lys Lys Lys Met Glu 
610 615 1620 


Gly Asp Leu Asn Glu Met Glu Val His Leu Gly His Ala Asn Arg 
625 630 1635 


Gln Val Ala Glu Thr Gln Lys His Leu Arg Thr Val Gln Gly Gln 
640 645 1650 


Leu Lys Gly Ser Gln Leu His Leu Gly Asp Ala Leu Arg Ser Asp 
655 660 1665 


Glu Ala Leu Lys Glu Arg Leu Ala Ile Val Gly Arg Arg Asn Gly 
670 675 1680 


Leu Leu Val Glu Glu Leu Glu Glu Thr Lys Val Ala Leu Glu Gln 
685 690 1695 


Ala Glu Arg Ser Arg Arg Leu Ser Glu Gln Glu Leu Gln Asp Ala 
700 705 1710 


Ser Asp Arg Val Gln Leu Leu Leu Ser Arg Cys Asn Thr Ser Leu 
1715 1720 1725 


Ile Asn Ile Lys Lys Lys Leu Glu Thr Asn Ile Ala Gin Cys Gln 
1730 1735 1740 


Ala Glu Val Glu Asn Ser Val Gln Glu Ser Arg Asn Ala Glu Glu 
1745 1750 1755 


Lys Ala Lys Lys Ala Ile Thr Asp Ala Ala Met Met Ala Glu Glu 
1760 1765 1770 


Leu Lys Lys Glu Gln Asp Thr Ser Ala His Leu Glu Arg Met Lys 
1775 1780 1785 


Lys Asn Leu Glu Gln Thr Val Lys Asp Met Gln His Arg Leu Asp 
1790 1795 1800 


Glu Ala Glu Gln Leu Ala Leu Lys Gly Asp Lys Lys Gln Ile Gln 
1805 1810 1815 


Lys Leu Glu Asn Arg Val Arg Glu Leu Glu Ser Glu Leu Asp Thr 
1820 1825 1830 


Glu Gln Lys Arg Gly Ala Glu Ala Leu Lys Gly Ala His Lys Tyr 
1835 1840 1845 


Glu Arg Lys Val Lys Glu Met Ala Tyr Gln Ala Glu Glu Asp His 
1850 1855 1860 


Lys Asn Ser Leu Arg Leu Gln Asp Leu Val Asp Lys Leu Gln Ala 
1865 1870 1875 


Lys Val Asn Ala Tyr Lys Arg Gln Ala Glu Glu Ala Glu Glu Gln 
880 1885 1890 


Ala Asn Thr Gln Leu Ser Arg Cys Arg Arg Val Gln His Glu Leu 
895 1900 1905 


Glu Glu Ala Glu Glu Arg Val Asp Ile Ala Glu Ser Gln Val Asn 
910 1915 1920 


Lys Leu Arg Ala Lys Ser Arg Asp Val Gly Ser Gln Val Arg Ala 
925 1930 1935 


Asp Thr Trp Ala Trp Leu Leu Leu Lys Glu Ser Arg Asp Pro Gly 
940 1945 1950 


Ala Gln Arg Ser His Arg Glu Ser Leu Glu Ala Ala His Trp Ser 
955 1960 1965 


Leu Leu Leu Val Met Pro Ser Gln Met Pro Pro Thr Gin Gln Arg 
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-continued 
1970 1975 1980 
Leu Leu 
1985 


<210> SEQ ID NO 6 

<211> LENGTH: 5958 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
polynucleotide 


<400> SEQUENCE: 6 


atgagcagcg acagcgagat ggccgtgttc ggcgaggccg ccccctacct gagaaagccc 60 
gaaaaagaaa gaacggaggc tcagaacaga cccttcgaca gcaaaaaggc ctgcttcgcc 120 
gtggacgaca aagaggtgta cgtgaagggc atgattcaga acagagagaa cgacaaggtg 180 
accgtgaaaa cccttgacga tcgtaccatc accctgaaca ccgcccaagt gttccccatg 240 
aaccccccca agttcgataa gatcgaagac atggccatga tgacccacct gcacgagccc 300 
gccgtgctgt acaacctgaa ggaaagatac gctgcctgga tgatctacac ctacagcggc 360 
ctgttctgcg tgaccgtgaa cccctacaag tggctgcccg tgtacaaccc cgaggtggtg 420 
atggcctaca gaggcaagaa gagacaagag gccccccccc acatctttag cattagcgac 480 
aacgcctatc agttcatgct gaccgacaga gacaatcaga gcatcctgat caccggcgag 540 
agcggcgccg gcaagaccgt gaacaccaag agagtgattc agtacttcgc caccatcgcc 600 
gtgcccggcg acaaaaaaaa agaaccgcaa acgggcaaga tgcaaggcac cctggaggat 660 
cagatcatcc aagccaaccc cctactggag gccttcggca acgccaagac cgtgagaaac 720 
gacaacagca gcagattcgg caagttcatc agaatccact tcggcgccac cggcaagctg 780 
gctagcgccg acatcgagac ctacctgctg gagaagagca gagtgacctt ccaactaagc 840 
tccgagagaa gctaccacat cttctatcag atcatgagca acaagaagcc cgaactgatc 900 
gatttactgc tgatcagcac caaccccttc gacttcccct tcatcagcca aggcgaagta 960 


accgtggcta gcatcgacga cagcgaggag ctgctggcca ccgacaacgc catcgacatc 1020 
ctgggcttca gcagcgagga gaaggtgggc atctacaagc tgaccggcgc cgtgatgcac 1080 
tacggcaaca tgaagttcaa gcaaaaacag agagaggagc aagctgagcc tgatggcacc 1140 
gaggtggccg acaaggccgg ctacctgatg ggtctgaaca gcgccgagat gctgaagggc 1200 
ctgtgctgcc caagagtgaa ggtgggcaac gagtacgtga ccaagggaca gaacgtgcag 1260 
caagtgacca acagcgtggg cgccctggcc aaggccgtgt acgagaagat gttcctgtgg 1320 
atggtggcta gaatcaatca gcaactggac accaagcaag ctagacagta cttcatcggc 1380 
gtgctggata tagccggctt cgagatcttc gacttcaact cactggagca gctttgcatc 1440 
aacttcacca atgagaagtt gcagcaattc ttcaaccacc acatgttcgt gctggaacaa 1500 
gaggaatata agaaggaggg catcgagtgg gagttcatcg acttcggcat ggacctggcc 1560 
gcctgcatcg agctgatcga aaaacccatg ggcatcttct caattctgga agaagagtgt 1620 
atgttcccca aggccaccga cacaagcttc aagaacaagc tgtacgatca gcacctgggc 1680 
aagagcgcca actttcagaa gcccaagccc accaagggca aggccgaagc ccacttcagc 1740 


ctggtgcact acgccggcac cgtggactac aacatcgtgg gctggctgga caagaacaag 1800 


US 2022/0333081 Al Oct. 20, 2022 
30 


-continued 
gaccccctga acgagaccgt ggtgggcctg tatcagaaga gcagcctgaa gctgctggcc 1860 
ttcctgttca gcaactacgc cggegeggaa gcgggggatt geggeggeag caagaaggge 1920 
ggcaagaaga agggcagcag ctttcagacc gtgagcgccg tgttcagaga gaacttaaat 1980 
aagctgatga ccaacctgag aagcacccac ccccacttcg tgagatgcct gatccccaac 2040 
gagaccaaga cccccggcgt gatggaccac tacctggtga tccatcagct gagatgcaac 2100 
ggtgttctgg agggtatcag aatctgcaga aagggcttcc caagcagaat cctgtacggc 2160 
gacttcagac agagatacag aatcctgaac gctagcgcca tccccgaggg acagttcatt 2220 
gatagcaaga acgcaagcga gaagctgctg aacagcatca acgtggacag agagcagtat 2280 
agattcggcc acaccaaggt gttcttcaag gccggcctgc tgggcctgct agaggagatg 2340 
agagacgaga aattggccac gctgatgaca agaacccaag ccgtgtgcag aggttatctg 2400 
atgcgcgtgg agttcaagaa aatgatggag agaagagaca gcatcctgtg catccaagac 2460 
aacatcagaa gcttcatgaa cgtgaagcac tggccctgga tgaacctgtt cttcaagatt 2520 
cagcccctgc tgaagtcagc ggaggcggaa aaggaaatgg ccaccatgaa ggagaacttc 2580 
aagagagccc aagaggagct ggctagaagc gaggctagaa gaaaggagct cgaggagaag 2640 
atggtgagcc tgctgcaaga gaagaacgac ctgcagctgc aagtccaaag cgagaccgag 2700 
aacctgatgg atgcagagga gcgatgtgag ggtctgatta agtccaaaat tcagctggaa 2760 
gccaaagtga aggaactgaa tgagaagcta gaagaggagg aagaaataaa cagcgagctg 2820 
gtggccaaga agcgcaacct ggaggacaag tgcttcagcc tgaagaagga catcgacgat 2880 
cttgagctga ccctgaccaa ggtggagaag gagaagcacg ccaccgagaa caaggtgaag 2940 
aacctgagcg aggagatgac cgccctggag gaaaacatta gcaagctgac caaagagaag 3000 
aagtcgctgc aagaggctca ccaacagact ctggatggcc tgcaagtgga ggaggataaa 3060 
gtaaacggct tgatcaagat taacgcaaag ttggaacaac agaccgacga tctggagggc 3120 
tgcttagaac aagaaaagaa gctgagagcc gacctgggca gactgaagag aaagctggag 3180 
ggcgacctga agataagcca agagagcatc accgacctgg agaacgctca gcagcagatg 3240 
gaggagaagc tgaagaagaa agaatttgag atcagccaac tgcagaccaa gatcgacggc 3300 
gagcaagtgc acagccttca gttacaaaag aagatcaagg agctacaagc tagaaccgag 3360 
gagcttgagg aagacatcga agccgaacac accctgagag cgaaactgga gagacagaga 3420 
agcgacctgg ctagagagct ggaggaaatc agcgagagac tcgaggaggc tagcggcgcc 3480 
acttccgccc aagtggagct gaacaagaag agagaggccg agtttcagaa gatgagaaga 3540 
gacctggaag aggccactct gcagcacgag gccaccgccg ccaccctgag aaagaagcac 3600 
gccgacagcg tggccgagct gggcgagcag atcgacaacc tgcagagagt gaagcaaaaa 3660 
ttacaaaaag aaaagagtga gctgaaaatg gagatggacg acatggctag caacatcgag 3720 
accatcacca agagcaagag caacatggag agaatgtgca gaaccatgga ggatcagttc 3780 
aacgagctga aggccaagga cgatcagcag acacagctga tccacgacct gaacatgcag 3840 
aaggctagac tgcagacaca gaacggcgag ctgagccacc aagtggagga gaaggaaagc 3900 
ctgatttctc agctgagcaa gggcaagcaa gccctgacac agcaattaga ggaactgaag 3960 
aggcagctcg aggaggagac caaggccaaa aacgccctgg cccacgccct gcagagcagc 4020 


agacacgact gcgacctgct gcgggagcag tacgaggaag agcaagaggg taaggctgag 4080 


US 2022/0333081 Al Oct. 20, 2022 
31 


-continued 
gtgcagcgag ccctgagcaa ggccaacage gaggtggctc agtggagaac caagtacgag 4140 
accgacgcca ttcagagaac tgaagaattg gaggaagcga agaagaaget cgcccaacga 4200 
ttgcaagaag ccgaagagaa caccgaggcc gtgaacagca agtgcgctag cctggagaag 4260 
acgaagcaaa gactacaagg tgaggtggat gatctgatgc tggacctgga gagagccaac 4320 
accacctgcg ccaccctggg caagaagcag agaaacttcg acaaaatcct ggctgactgg 4380 
aagcagaage tggacgagag ccacgccgag ctggaagccg ctcagaagga aagcagatcc 4440 
ttaagcaccg agatcttcaa gatgaagaac gcctacgaag aggtggtcga tcagctggag 4500 
accctgagaa gagagaacaa gaaccttcaa gaggagatcc acgacgtgac cgagcagatc 4560 
gccgagaccg gtaaaaacct acaagagatg gagaagacca agaagcaagt ggaacaagag 4620 
aagtccgatc tgcaagtggc cctggaagaa gtagagggca gccttgaaca tgaggagtcg 4680 
aaaatcctga gagtacagct ggagctgtct cagctgaagt ctgagttgga tagaaaagtc 4740 
gctgaaaagg acgaggagat ggagcagctg aagagaaact gtcagagagc cgccgaggcc 4800 
acacagagcc tgttagaggc cgaatcgaga agcagaaaag acgcactgag actgaagaag 4860 
aagatggaag gcgatttgaa cgagatggag gtgcacctgg gccacgccaa cagacaagtg 4920 
gecgaaacac agaagcacct gagaaccgtg caaggacagc tgaagggctc tcagctgcac 4980 
cttggcgacg ccctaagaag cgatgaagca ctgaaggaac gtttagctat cgtgggcaga 5040 
agaaacggcc tgctggttga ggagcttgaa gagacgaaag tagcccttga gcaagccgaa 5100 
agaagcagaa gactgagcga gcaagagcta caagacgcta gcgaccgtgt acagctgttg 5160 
ctgagccgtt gtaacacaag cctgatcaac atcaagaaga aactggagac aaacatcgct 5220 
cagtgccaag ccgaggtgga gaacagcgtg caagagagca gaaatgccga ggagaaggcc 5280 
aagaaggcca tcaccgacge cgccatgatg gccgaggagc tgaaaaaaga gcaagacacg 5340 
tctgctcatc tggagagaat gaagaagaac ctggagcaga ccgtgaagga catgcagcac 5400 
agactggacg aggccgaaca gcttgccctg aagggagaca aaaaacaaat tcagaagctg 5460 
gagaaccgag tgcgcgaact ggagagcgaa ctggataccg agcagaagag aggcgccgag 5520 
gcactcaaag gggcccacaa gtacgagaga aaagttaagg agatggccta ccaagccgaa 5580 
gaggatcaca agaacagcct gaggctgcaa gacctagtgg acaagctgca agccaaagtg 5640 
aacgcctaca agagacaagc cgaggaagca gaagaacaag ccaacacaca gctgtctagg 5700 
tgcagaagag tgcagcacga gctggaggag gctgaagaga gagtcgacat agcggagagc 5760 
caagtgaata aattaagagc gaagagccgg gacgtgggca gccaagtgag agccgacacc 5820 
tgggcctggc tgctgctgaa agagagcaga gaccccggcg ctcagagaag ccacagagag 5880 
agcttagagg cagcgcactg gagcctgctg ctggtgatgc catctcagat gccccccaca 5940 


cagcagcgcc tgctttga 5958 


«210» SEQ ID NO 7 

«211» LENGTH: 1453 

«212» TYPE: PRT 

«213» ORGANISM: Artificial Sequence 

«220» FEATURE: 

«223» OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
polypeptide 


«400» SEQUENCE: 7 


Met Phe Ser Phe Val Asp Ser Arg Leu Leu Leu Leu Ile Ala Ala Thr 
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-continued 
1 5 10 15 


Val Leu Leu Thr Arg Gly Gln Gly Glu Glu Asp Ile Gln Thr Gly Ser 
20 25 30 


Cys Val Gln Asp Gly Leu Thr Tyr Asn Asp Lys Asp Val Trp Lys Pro 
35 40 45 


Glu Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Asn Ile Leu Cys Asp 
50 55 60 


Glu Val Ile Cys Glu Asp Thr Ser Asp Cys Pro Asn Ala Glu Ile Pro 
65 70 75 80 


Phe Gly Glu Cvs Cvs Pro Ile Cvs Pro Asp Val Asp Ala Ser Pro Val 
85 90 95 


Tvr Pro Glu Ser Ala Gly Val Glu Gly Pro Lvs Gly Asp Thr Gly Pro 
100 105 110 


Arg Glv Asp Arg Gly Leu Pro Gly Pro Pro Gly Arg Asp Gly Ile Pro 
115 120 125 


Gly Gln Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 
130 135 140 


Leu Gly Gly Asn Phe Ala Pro Gln Met Ser Tyr Gly Tyr Asp Glu Lys 
145 150 155 160 


Ser Ala Gly Val Ala Val Pro Gly Pro Met Gly Pro Ala Gly Pro Arg 
165 170 175 


Gly Leu Pro Gly Pro Pro Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly 
180 185 190 


Pro Pro Gly Glu Pro Gly Glu Pro Gly Ala Ser Gly Pro Met Gly Pro 
195 200 205 


Arg Gly Pro Ala Gly Pro Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala 
210 215 220 


Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Pro Pro Gly Pro Gln Gly 
225 230 235 240 


Ala Arg Gly Leu Pro Gly Thr Ala Gly Leu Pro Gly Met Lys Gly His 
245 250 255 


Arg Gly Phe Ser Gly Leu Asp Gly Ala Lys Gly Glu Pro Gly Pro Ala 
260 265 270 


Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly 
275 280 285 


Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg Gly Arg Pro Gly Pro 
290 295 300 


Ser Gly Pro Ala Gly Ala Arg Gly Asn Asp Gly Ala Pro Gly Ala Ala 
305 310 315 320 


Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly 
325 330 335 


Ala Ala Gly Ala Lys Gly Glu Thr Gly Pro Gln Gly Ala Arg Gly Ser 
340 345 350 


Glu Gly Pro Gln Gly Ala Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala 
355 360 365 


Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ala Asp Gly Gln Pro Gly 
370 375 380 


Ala Lys Gly Ala Thr Gly Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe 
385 390 395 400 


Pro Gly Ala Arg Gly Ala Pro Gly Pro Gln Gly Pro Ser Gly Ala Pro 
405 410 415 
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-continued 
Gly Pro Lys Gly Asn Ser Gly Glu Pro Gly Ala Pro Gly Asn Lys Gly 
420 425 430 


Asp Thr Gly Ala Lys Gly Glu Pro Gly Pro Ala Gly Val Gln Gly Pro 
435 440 445 


Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro 
450 455 460 


Gly Pro Ala Gly Leu Pro Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly 
465 470 475 480 


Ser Arg Gly Phe Pro Gly Ala Asp Gly Ile Ala Gly Pro Lys Gly Pro 
485 490 495 


Pro Gly Glu Arg Gly Ser Pro Gly Ala Val Gly Pro Lys Gly Ser Pro 
500 505 510 


Gly Glu Ala Gly Arg Pro Gly Glu Pro Gly Leu Pro Gly Ala Lys Gly 
515 520 525 


Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro 
530 535 540 


Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro 
545 550 555 560 


Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe Pro Gly Pro Lys Gly 
565 570 575 


Ala Ala Gly Glu Pro Gly Lys Pro Gly Glu Arg Gly Ala Pro Gly Pro 
580 585 590 


Pro Gly Ala Val Gly Ala Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln 
595 600 605 


Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly 
610 615 620 


Pro Ala Gly Ala Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro 
625 630 635 640 


Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Ala 
645 650 655 


Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly 
660 665 670 


Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Gln Gly Pro Arg Gly Ala 
675 680 685 


Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro 
690 695 700 


Gly Ala Pro Gly Asn Gln Gly Pro Pro Gly Leu Gln Gly Met Pro Gly 
705 710 7:15 720 


Glu Arg Gly Ala Ala Gly Leu Pro Gly Ala Lys Gly Asp Arg Gly Asp 
725 730 735 


Pro Gly Pro Lys Gly Ala Asp Gly Ala Pro Gly Lys Asp Gly Leu Arg 
740 745 750 


Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly 
755 760 765 


Asp Lys Gly Glu Ala Gly Pro Ser Gly Pro Pro Gly Pro Thr Gly Ala 
770 775 780 


Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala 
785 790 795 800 


Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly 
805 810 815 
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-continued 


Glu Thr Gly Asp Ala Gly Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro 
820 825 830 


Ala Gly Pro Thr Gly Ala Pro Gly Pro Ala Gly Ala Val Gly Ala Pro 
835 840 845 


Gly Pro Lys Gly Ala Arg Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly 
850 855 860 


Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ser Gly Asn 
865 870 875 880 


Ile Gly Leu Pro Gly Pro Pro Gly Pro Ser Gly Lys Glu Gly Gly Lys 
885 890 895 


Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg Pro Gly Glu Pro Gly 
900 905 910 


Pro Ala Gly Pro Pro Gly Pro Pro Gly Glu Lys Gly Ser Pro Gly Ala 
915 920 925 


Asp Gly Pro Ile Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala 
930 935 940 


Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly 
945 950 955 960 


Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro 
965 970 975 


Ser Gly Ser Pro Gly Glu Arg Gly Pro Pro Gly Pro Met Gly Pro Pro 
980 985 990 


Gly Leu Ala Gly Pro Pro Gly Glu Ala Gly Arg Glu Gly Ala Pro Gly 
995 1000 1005 


Ala Glu Gly Ala Pro Gly Arg Asp Gly Ala Ala Gly Pro Lys Gly 
1010 1015 1020 


Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Pro Gly Ala Pro Gly 
1025 1030 1035 


Ala Pro Gly Ala Pro Gly Pro Val Gly Pro Ala Gly Lys Asn Gly 
1040 1045 1050 


Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Ala Gly Pro Pro Gly 
1055 1060 1065 


Pro Ala Giv Ala Arg Gly Pro Ala Gly Pro Gln Gly Pro Arg Gly 
1070 1075 1080 


Asp Lys Gly Glu Thr Gly Glu Gln Gly Asp Arg Gly Met Lys Gly 
1085 1090 1095 


His Arg Gly Phe Ser Gly Leu Gln Gly Pro Pro Gly Pro Pro Gly 
1100 1105 1110 


Ala Pro Gly Glu Gln Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly 
15 1120 1125 


Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Ala Gly Lys Asp Gly 
30 1135 1140 


Leu Asn Gly Leu Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly 
45 1150 1155 


Arg Thr Gly Glu Val Gly Pro Val Gly Pro Pro Gly Pro Pro Gly 
60 1165 1170 


Pro Pro Gly Pro Pro Gly Pro Pro Ser Gly Gly Phe Asp Phe Ser 
75 1180 1185 


Phe Leu Pro Gln Pro Pro Gln Glu Lys Ala His Asp Gly Gly Arg 
90 1195 1200 


Tyr Tyr Arg Ala Asp Asp Ala Asn Val Met Arg Asp Arg Asp Leu 
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-continued 
205 210 1215 


Glu Val Asp Thr Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn 
220 225 1230 


Ile Arg Ser Pro Glu Gly Thr Arg Lvs Asn Pro Ala Arg Thr Cvs 
235 240 1245 


Arg Asp Leu Lys Met Cys His Gly Asp Trp Lys Ser Gly Glu Tvr 
250 255 1260 


Trp Ile Asp Pro Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val 
265 270 1275 


Tyr Cys Asn Met Glu Thr Gly Glu Thr Cys Val Tyr Pro Thr Gln 
280 285 1290 


Ala Thr Ile Ala Gln Lys Asn Trp Tyr Leu Ser Lys Asn Pro Lys 
295 300 1305 


Glu Lys Lys His Val Trp Phe Gly Glu Thr Met Ser Asp Gly Phe 
310 315 1320 


Gln Phe Glu Tyr Gly Gly Glu Gly Ser Asn Pro Ala Asp Val Ala 
1325 1330 1335 


Ile Gln Leu Thr Phe Leu Arg Leu Met Ser Thr Glu Ala Thr Gln 
1340 1345 1350 


Asn Val Thr Tyr His Cys Lys Asn Ser Val Ala Tyr Met Asp His 
1355 1360 1365 


Asp Thr Gly Asn Leu Lys Lys Ala Leu Leu Leu Gln Gly Ala Asn 
1370 1375 1380 


Glu Ile Glu Ile Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Gly 
1385 1390 1395 


Val Thr Glu Asp Gly Cys Thr Ser His Thr Gly Ala Trp Gly Lys 
1400 1405 1410 


Thr Val Ile Glu Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro Ile 
1415 1420 1425 


Ile Asp Leu Ala Pro Met Asp Val Gly Ala Pro Asp Gln Glu Phe 
1430 1435 1440 


Gly Ile Asp Ile Gly Pro Val Cys Phe Leu 
1445 1450 


<210> SEQ ID NO 8 

<211> LENGTH: 1453 

<212> TYPE: PRT 

<213> ORGANISM: Gallus gallus 


<400> SEQUENCE: 8 


Met Phe Ser Phe Val Asp Ser Arg Leu Leu Leu Leu Ile Ala Ala Thr 
1 5 10 15 


Val Leu Leu Thr Arg Gly Gln Gly Glu Glu Asp Ile Gln Thr Gly Ser 
20 25 30 


Cys Val Gln Asp Gly Leu Thr Tyr Asn Asp Lys Asp Val Trp Lys Pro 
35 40 45 


Glu Pro Cys Gln Ile Cys Val Cys Asp Ser Gly Asn Ile Leu Cys Asp 
50 55 60 


Glu Val Ile Cys Glu Asp Thr Ser Asp Cys Pro Asn Ala Glu Ile Pro 
65 70 45 80 


Phe Gly Glu Cys Cys Pro Ile Cys Pro Asp Val Asp Ala Ser Pro Val 
85 90 95 
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-continued 


Tyr Pro Glu Ser Ala Gly Val Glu Gly Pro Lys Gly Asp Thr Gly Pro 
100 105 110 


Arg Gly Asp Arg Gly Leu Pro Gly Pro Pro Gly Arg Asp Gly Ile Pro 
115 120 125 


Gly Gln Pro Gly Leu Pro Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 
130 135 140 


Leu Gly Gly Asn Phe Ala Pro Gln Met Ser Tyr Gly Tyr Asp Glu Lys 
145 150 155 160 


Ser Ala Gly Val Ala Val Pro Gly Pro Met Gly Pro Ala Gly Pro Arg 
165 170 175 


Gly Leu Pro Gly Pro Pro Gly Ala Pro Gly Pro Gln Gly Phe Gln Gly 
180 185 190 


Pro Pro Gly Glu Pro Gly Glu Pro Gly Ala Ser Gly Pro Met Gly Pro 
195 200 205 


Arg Gly Pro Ala Gly Pro Pro Gly Lys Asn Gly Asp Asp Gly Glu Ala 
210 215 220 


Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Pro Pro Gly Pro Gln Gly 
225 230 235 240 


Ala Arg Gly Leu Pro Gly Thr Ala Gly Leu Pro Gly Met Lys Gly His 
245 250 255 


Arg Gly Phe Ser Gly Leu Asp Gly Ala Lys Gly Glu Pro Gly Pro Ala 
260 265 270 


Gly Pro Lys Gly Glu Pro Gly Ser Pro Gly Glu Asn Gly Ala Pro Gly 
275 280 285 


Gln Met Gly Pro Arg Gly Leu Pro Gly Glu Arg Gly Arg Pro Gly Pro 
290 295 300 


Ser Gly Pro Ala Gly Ala Arg Gly Asn Asp Gly Ala Pro Gly Ala Ala 
305 310 315 320 


Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Pro Pro Gly Phe Pro Gly 
325 330 335 


Ala Ala Gly Ala Lys Gly Glu Thr Gly Pro Gln Gly Ala Arg Gly Ser 
340 345 350 


Glu Gly Pro Gln Gly Ala Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala 
355 360 365 


Gly Ala Ala Gly Pro Ala Gly Asn Pro Gly Ala Asp Gly Gln Pro Gly 
370 375 380 


Ala Lys Gly Ala Thr Gly Ala Pro Gly Ile Ala Gly Ala Pro Gly Phe 
385 390 395 400 


Pro Gly Ala Arg Pro Ser Pro Gly Pro Gln Gly Pro Ser Gly Ala Pro 
405 410 415 


Gly Pro Lys Gly Asn Ser Gly Glu Pro Gly Ala Pro Gly Asn Lys Gly 
420 425 430 


Asp Thr Gly Ala Lys Gly Glu Pro Gly Pro Ala Gly Val Gln Gly Pro 
435 440 445 


Pro Gly Pro Ala Gly Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Pro 
450 455 460 


Gly Pro Ala Gly Leu Pro Gly Pro Ala Gly Glu Arg Gly Ala Pro Gly 
465 470 475 480 


Ser Arg Gly Phe Pro Gly Ala Asp Gly Ile Ala Gly Pro Lys Gly Pro 
485 490 495 


Pro Gly Glu Arg Gly Ser Pro Gly Ala Val Gly Pro Lys Gly Ser Pro 
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-continued 
500 505 510 


Gly Glu Ala Gly Arg Pro Gly Glu Pro Gly Leu Pro Gly Ala Lys Gly 
515 520 525 


Leu Thr Gly Ser Pro Gly Ser Pro Gly Pro Asp Gly Lys Thr Gly Pro 
530 535 540 


Pro Gly Pro Ala Gly Gln Asp Gly Arg Pro Gly Pro Pro Gly Pro Pro 
545 550 555. 560 


Gly Ala Arg Gly Gln Ala Gly Val Met Gly Phe Pro Gly Pro Lys Gly 
565 570 575 


Ala Ala Gly Glu Pro Gly Lys Pro Gly Glu Arg Gly Ala Pro Gly Pro 
580 585 590 


Pro Gly Ala Val Gly Ala Ala Gly Lys Asp Gly Glu Ala Gly Ala Gln 
595 600 605 


Gly Pro Pro Gly Pro Thr Gly Pro Ala Gly Glu Arg Gly Glu Gln Gly 
610 615 620 


Pro Ala Gly Ala Pro Gly Phe Gln Gly Leu Pro Gly Pro Ala Gly Pro 
625 630 635 640 


Pro Gly Glu Ala Gly Lys Pro Gly Glu Gln Gly Val Pro Gly Asp Ala 
645 650 655 


Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Glu Arg Gly Phe Pro Gly 
660 665 670 


Glu Arg Gly Val Gln Gly Pro Pro Gly Pro Gln Gly Pro Arg Gly Ala 
675 680 685 


Asn Gly Ala Pro Gly Asn Asp Gly Ala Lys Gly Asp Ala Gly Ala Pro 
690 695 700 


Gly Ala Pro Gly Asn Gln Gly Pro Pro Gly Leu Gln Gly Met Pro Gly 
705 710 715 720 


Glu Arg Gly Ala Ala Gly Leu Pro Gly Ala Lys Gly Asp Arg Gly Asp 
725 730 735 


Pro Gly Pro Lys Gly Ala Asp Gly Ala Pro Gly Lys Asp Gly Leu Arg 
740 745 750 


Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly 
755 760 765 


Asp Lys Gly Glu Ala Gly Pro Pro Gly Pro Ala Gly Pro Thr Gly Ala 
770 775 780 


Arg Gly Ala Pro Gly Asp Arg Gly Glu Pro Gly Pro Pro Gly Pro Ala 
785 790 795 800 


Gly Phe Ala Gly Pro Pro Gly Ala Asp Gly Gln Pro Gly Ala Lys Gly 
805 810 815 


Glu Thr Gly Asp Ala Gly Ala Lys Gly Asp Ala Gly Pro Pro Gly Pro 
820 825 830 


Ala Gly Pro Thr Gly Ala Pro Gly Pro Ala Gly Ala Val Gly Ala Pro 
835 840 845 


Gly Pro Lys Gly Ala Arg Gly Ser Ala Gly Pro Pro Gly Ala Thr Gly 
850 855 860 


Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ser Gly Asn 
865 870 875 880 


Ile Gly Leu Pro Gly Pro Pro Gly Pro Ser Gly Lys Glu Gly Gly Lys 
885 890 895 


Gly Pro Arg Gly Glu Thr Gly Pro Ala Gly Arg Pro Gly Glu Pro Gly 
900 905 910 
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Pro Ala Gly Pro Pro Gly Pro Pro Gly Glu Lys Gly Ser Pro Gly Ala 
915 920 925 


Asp Gly Pro Ile Gly Ala Pro Gly Thr Pro Gly Pro Gln Gly Ile Ala 
930 935 940 


Gly Gln Arg Gly Val Val Gly Leu Pro Gly Gln Arg Gly Glu Arg Gly 
945 950 955 960 


Phe Pro Gly Leu Pro Gly Pro Ser Gly Glu Pro Gly Lys Gln Gly Pro 
965 970 975 


Ser Gly Ser Pro Gly Glu Arg Gly Pro Pro Gly Pro Met Gly Pro Pro 
980 985 990 


Gly Leu Ala Gly Pro Pro Gly Glu Ala Gly Arg Glu Gly Ala Pro Gly 
995 1000 1005 


Ala Glu Gly Ala Pro Gly Arg Asp Gly Ala Ala Gly Pro Lys Gly 
1010 1015 1020 


Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Pro Gly Ala Pro Gly 
1025 1030 1035 


Ala Pro Gly Ala Pro Gly Pro Val Gly Pro Ala Gly Lys Asn Gly 
1040 1045 1050 


Asp Arg Gly Glu Thr Gly Pro Ala Gly Pro Ala Gly Pro Pro Gly 
1055 1060 1065 


Pro Ala Gly Ala Arg Gly Pro Ala Gly Pro Gln Gly Pro Arg Gly 
1070 1075 1080 


Asp Lys Gly Glu Thr Gly Glu Gln Gly Asp Arg Gly Met Lys Gly 
1085 1090 1095 


His Arg Gly Phe Ser Gly Leu Gln Gly Pro Pro Gly Pro Pro Gly 
1100 1105 1110 


Ala Pro Gly Glu Gln Gly Pro Ser Gly Ala Ser Gly Pro Ala Gly 
1115 1120 1125 


Pro Arg Gly Pro Pro Gly Ser Ala Gly Ala Ala Gly Lys Asp Gly 
1130 1135 1140 


Leu Asn Gly Leu Pro Gly Pro Ile Gly Pro Pro Gly Pro Arg Gly 
1145 1150 1155 


Arg Thr Gly Glu Val Gly Pro Val Gly Pro Pro Gly Pro Pro Gly 
1160 1165 1170 


Pro Pro Gly Pro Pro Gly Pro Pro Ser Gly Gly Phe Asp Phe Ser 
1175 1180 1185 


Phe Leu Pro Gln Pro Pro Gln Glu Lys Ala His Asp Gly Gly Arg 
190 1195 1200 


Tyr Tyr Arg Ala Asp Asp Ala Asn Val Met Arg Asp Arg Asp Leu 


205 1210 1215 


Glu Val Asp Thr Thr Leu Lys Ser Leu Ser Gln Gln Ile Glu Asn 
220 1225 1230 


Ile Arg Ser Pro Glu Gly Thr Arg Lys Asn Pro Ala Arg Thr Cys 
235 240 1245 


Arg Asp Leu Lys Met Cys His Gly Asp Trp Lys Ser Gly Glu Tyr 
250 255 1260 


Trp Ile Asp Pro Asn Gln Gly Cys Asn Leu Asp Ala Ile Lys Val 
265 270 1275 


Tyr Cys Asn Met Glu Thr Gly Glu Thr Cys Val Tyr Pro Thr Gin 
280 285 1290 
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Ala Thr Ile Ala Gln Lys Asn Trp Tyr Leu Ser Lys Asn Pro Lys 
295 300 1305 


Glu Lys Lys His Val Trp Phe Gly Glu Thr Met Ser Asp Gly Phe 
310 315 1320 


Gln Phe Glu Tyr Gly Gly Glu Gly Ser Asn Pro Ala Asp Val Ala 
325 330 1335 


Ile Gln Leu Thr Phe Leu Arg Leu Met Ser Thr Glu Ala Thr Gln 
340 345 1350 


Asn Val Thr Tyr His Cys Lys Asn Ser Val Ala Tyr Met Asp His 
355 360 1365 


Asp Thr Gly Asn Leu Lys Lys Ala Leu Leu Leu Gln Gly Ala Asn 
370 375 1380 


Glu Ile Glu Ile Arg Ala Glu Gly Asn Ser Arg Phe Thr Tyr Gly 
385 390 1395 


Val Thr Glu Asp Gly Cys Thr Ser His Thr Gly Ala Trp Gly Lys 
400 405 1410 


Thr Val Ile Glu Tyr Lys Thr Thr Lys Thr Ser Arg Leu Pro Ile 
1415 1420 1425 


Ile Asp Leu Ala Pro Met Asp Val Gly Ala Pro Asp Gln Glu Phe 
1430 1435 1440 


Gly Ile Asp Ile Gly Pro Val Cys Phe Leu 
1445 1450 


<210> SEQ ID NO 9 

<211> LENGTH: 15 

<212> TYPE: PRT 

<213> ORGANISM: Tyrannosaurus rex 


<400> SEQUENCE: 9 
Gly Ala Pro Gly Pro Gln Gly Pro Ser Gly Ala Pro Gly Pro Lys 
1 5 10 15 


<210> SEQ ID NO 10 

<211> LENGTH: 33 

<212> TYPE: PRT 

<213> ORGANISM: Brachylophosaurus canadensis 


<400> SEQUENCE: 10 


Gly Leu Thr Gly Pro Ile Gly Pro Pro Gly Pro Ala Gly Ala Pro Gly 
T 5 10 15 


Asp Lys Gly Glu Ala Gly Pro Ser Gly Pro Pro Gly Pro Thr Gly Ala 
20 25 30 


Arg 

«210» SEQ ID NO 11 

«211» LENGTH: 1363 

«212» TYPE: PRT 

«213» ORGANISM: Gallus gallus 


«400» SEQUENCE: 11 


Met Leu Ser Phe Val Asp Thr Arg Ile Leu Leu Leu Leu Ala Val Thr 
L 5 10 15 


Ser Tyr Leu Ala Thr Ser Gln His Val Ser Glu Ala Ser Ala Gly Arg 
20 25 30 


Lys Gly Pro Arg Gly Asp Lys Gly Pro Gln Gly Glu Arg Gly Pro Pro 
35 40 45 
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Gly Pro Pro Gly Arg Asp Gly Glu Asp Gly Pro Pro Gly Pro Pro Gly 
50 55 60 


Pro Pro Gly Pro Pro Gly Leu Gly Gly Asn Phe Ala Ala Gln Tyr Asp 
65 70 75 80 


Pro Ser Lys Ala Ala Asp Phe Gly Pro Gly Pro Met Gly Leu Met Gly 
85 90 95 


Pro Arg Gly Pro Pro Gly Ala Ser Gly Pro Pro Gly Pro Pro Gly Phe 
100 105 110 


Gln Gly Val Pro Gly Glu Pro Gly Glu Pro Gly Gln Thr Gly Pro Gln 
T115 120 125 


Gly Pro Arg Gly Pro Pro Gly Pro Pro Gly Lys Ala Gly Glu Asp Gly 
130 135 140 


His Pro Gly Lys Pro Gly Arg Pro Gly Glu Arg Gly Val Ala Gly Pro 
145 150 155 160 


Gin Gly Ala Arg Gly Phe Pro Gly Thr Pro Gly Leu Pro Gly Phe Lys 
165 170 175 


Gly Ile Arg Gly His Asn Gly Leu Asp Gly Gln Lys Gly Gln Pro Gly 
180 185 190 


Thr Pro Gly Thr Lys Gly Glu Pro Gly Ala Pro Gly Glu Asn Gly Thr 
195 200 205 


Pro Gly Gln Pro Gly Ala Arg Gly Leu Pro Gly Glu Arg Gly Arg Ile 
210 215 220 


Gly Ala Pro Gly Pro Ala Gly Ala Arg Gly Ser Asp Gly Ser Ala Gly 
225 230 235 240 


Pro Thr Gly Pro Ala Gly Pro Ile Gly Ala Ala Gly Pro Pro Gly Phe 
245 250 255 


Pro Gly Ala Pro Gly Ala Lys Gly Glu Ile Gly Pro Ala Gly Asn Val 
260 265 270 


Gly Pro Thr Gly Pro Ala Gly Pro Arg Gly Glu Ile Gly Leu Pro Gly 
275 280 285 


Ser Ser Gly Pro Val Gly Pro Pro Gly Asn Pro Gly Ala Asn Gly Leu 
290 295 300 


Pro Gly Ala Lys Gly Ala Ala Gly Leu Pro Gly Val Ala Gly Ala Pro 
305 310 315 320 


Gly Leu Pro Gly Pro Arg Gly Ile Pro Gly Pro Pro Gly Pro Ala Gly 
325 330 335 


Pro Ser Gly Ala Arg Gly Leu Val Gly Glu Pro Gly Pro Ala Gly Ala 
340 345 350 


Lys Gly Glu Ser Gly Asn Lys Gly Glu Pro Gly Ala Ala Gly Pro Pro 
355 360 365 


Gly Pro Pro Gly Pro Ser Gly Glu Glu Gly Lys Arg Gly Ser Asn Gly 
370 375 380 


Glu Pro Gly Ser Ala Gly Pro Pro Gly Pro Ala Gly Leu Arg Gly Val 
385 390 395 400 


Pro Gly Ser Arg Gly Leu Pro Gly Ala Asp Gly Arg Ala Gly Val Met 
405 410 415 


Gly Pro Ala Gly Asn Arg Gly Ala Ser Gly Pro Val Gly Ala Lys Gly 
420 425 430 


Pro Asn Gly Asp Ala Gly Arg Pro Gly Glu Pro Gly Leu Met Gly Pro 
435 440 445 
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Arg Gly Leu Pro Gly Gln Pro Gly Ser Pro Gly Pro Ala Gly Lvs Glu 
450 455 460 


Gly Pro Val Gly Phe Pro Gly Ala Asp Gly Arg Val Gly Pro Ile Gly 
465 470 475 480 


Pro Ala Gly Asn Arg Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro 
485 490 495 


Lys Gly Pro Thr Gly Glu Pro Gly Lys Pro Gly Glu Lvs Gly Asn Val 
500 505 510 


Gly Leu Ala Gly Pro Arg Gly Ala Pro Gly Pro Glu Gly Asn Asn Gly 
515 520 525 


Ala Gln Gly Pro Pro Gly Val Thr Gly Asn Gln Gly Ala Lys Gly Glu 
530 535 540 


Thr Gly Pro Ala Gly Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro Ser 
545 550 555 560 


Gly Pro Ala Gly Glu Ala Gly Lys Pro Gly Glu Arg Gly Leu His Gly 
565 570 575 


Glu Phe Gly Val Pro Gly Pro Ala Gly Pro Arg Gly Glu Arg Gly Leu 
580 585 590 


Pro Gly Glu Ser Gly Ala Val Gly Pro Ala Gly Pro Ile Gly Ser Arg 
595 600 605 


Gly Pro Ser Gly Pro Pro Gly Pro Asp Gly Asn Lys Gly Glu Pro Gly 
610 615 620 


Asn Val Gly Pro Ala Gly Ala Pro Gly Pro Ala Gly Pro Gly Gly Ile 
625 630 635 640 


Pro Gly Glu Arg Gly Val Ala Gly Val Pro Gly Gly Lys Gly Glu Lys 
645 650 655 


Gly Ala Pro Gly Leu Arg Gly Asp Thr Gly Ala Thr Gly Arg Asp Gly 
660 665 670 


Ala Arg Gly Leu Pro Gly Ala Ile Gly Ala Pro Gly Pro Ala Gly Gly 
675 680 685 


Ala Gly Asp Arg Gly Glu Gly Gly Pro Ala Gly Pro Ala Gly Pro Ala 
690 695 700 


Gly Ala Arg Gly Ile Pro Gly Glu Arg Gly Glu Pro Gly Pro Val Gly 
705 710 715 720 


Pro Ser Gly Phe Ala Gly Pro Pro Gly Ala Ala Gly Gln Pro Gly Ala 
725 730 735 


Lys Gly Glu Arg Gly Pro Lys Gly Pro Lys Gly Glu Thr Gly Pro Thr 
740 745 750 


Gly Ala Ile Gly Pro Ile Gly Ala Ser Gly Pro Pro Gly Pro Val Gly 
755 760 765 


Ala Ala Gly Pro Ala Gly Pro Arg Gly Asp Ala Gly Pro Pro Gly Met 
770 775 780 


Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ala 
785 790 795 800 


Gly Ile Thr Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys Asp Gly 
805 810 815 


Pro Arg Gly Leu Arg Gly Asp Val Gly Pro Val Gly Arg Thr Gly Glu 
820 825 830 


Gln Gly Ile Ala Gly Pro Pro Gly Phe Ala Gly Glu Lys Gly Pro Ser 
835 840 845 


Gly Glu Ala Gly Ala Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly 


US 2022/0333081 Al Oct. 20, 2022 
42 


-continued 
850 855 860 


Ile Leu Gly Ala Pro Gly Ile Leu Gly Leu Pro Gly Ser Arg Gly Glu 
865 870 875 880 


Arg Gly Leu Pro Gly Ile Ala Gly Ala Thr Gly Glu Pro Gly Pro Leu 
885 890 895 


Gly Val Ser Gly Pro Pro Gly Ala Arg Gly Pro Ser Gly Pro Val Gly 
900 905 910 


Ser Pro Gly Pro Asn Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn 
915 920 925 


Pro Gly Asn Asp Gly Pro Pro Gly Arg Asp Gly Ala Pro Gly Phe Lys 
930 935 940 


Gly Glu Arg Gly Ala Pro Gly Asn Pro Gly Pro Ser Gly Ala Leu Gly 
945 950 955 960 


Ala Pro Gly Pro His Gly Gln Val Gly Pro Ser Gly Lys Pro Gly Asn 
965 970 975 


Arg Gly Asp Pro Gly Pro Val Gly Pro Val Gly Pro Ala Gly Ala Phe 
980 985 990 


Gly Pro Arg Gly Leu Ala Gly Pro Gln Gly Pro Arg Gly Glu Lys Gly 
995 1000 1005 


Glu Pro Gly Asp Lys Gly His Arg Gly Leu Pro Gly Leu Lys Gly 
1010 1015 1020 


His Asn Gly Leu Gln Gly Leu Pro Gly Leu Ala Gly Gln His Gly 
1025 1030 1035 


Asp Gln Gly Pro Pro Gly Asn Asn Gly Pro Ala Gly Pro Arg Gly 
1040 1045 1050 


Pro Pro Gly Pro Ser Gly Pro Pro Gly Lys Asp Gly Arg Asn Gly 
1055 1060 1065 


Leu Pro Gly Pro Ile Gly Pro Ala Gly Val Arg Gly Ser His Gly 
1070 1075 1080 


Ser Gln Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 
1085 1090 1095 


Pro Pro Gly Pro Asn Gly Gly Gly Tyr Glu Val Gly Phe Asp Ala 
1100 1105 1110 


Glu Tyr Tyr Arg Ala Asp Gln Pro Ser Leu Arg Pro Lys Asp Tyr 
1115 1120 1125 


Glu Val Asp Ala Thr Leu Lys Thr Leu Asn Asn Gln Ile Glu Thr 
1130 1135 1140 


Leu Leu Thr Pro Glu Gly Ser Lys Lys Asn Pro Ala Arg Thr Cys 
45 1150 1155 


Arg Asp Leu Arg Leu Ser His Pro Glu Trp Ser Ser Gly Phe Tyr 
60 1165 1170 


Trp e Asp Pro Asn Gln Gly Cys Thr Ala Asp Ala Ile Arg Ala 
75 1180 1185 


Tyr Cvs Asp Phe Ala Thr Gly Glu Thr Cys Ile His Ala Ser Leu 
90 1195 1200 


Glu Asp Ile Pro Thr Lys Thr Trp Tyr Val Ser Lys Asn Pro Lys 
205 1210 1215 


Asp Lys Lys His Ile Trp Phe Gly Glu Thr Ile Asn Gly Gly Thr 
220 1225 1230 


Gln Phe Glu Tyr Asn Gly Glu Gly Val Thr Thr Lys Asp Met Ala 
235 1240 1245 
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Thr Gln Leu Ala Phe Met Arg Leu Leu Ala Asn His Ala Ser Gln 
250 255 1260 


Asn Ile Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu 
265 270 1275 


Glu Thr Gly Asn Leu Lys Lys Ala Val Ile Leu Gln Gly Ser Asn 
280 285 1290 


Asp Val Glu Leu Arg Ala Glu Gly Asn Ser Arg Phe Thr Phe Ser 
295 300 1305 


Val Leu Val Asp Gly Cys Ser Lys Lys Asn Asn Lys Trp Gly Lys 
310 315 1320 


Thr Ile Ile Glu Tyr Arg Thr Asn Lys Pro Ser Arg Leu Pro Ile 
325 330 1335 


Leu Asp Ile Ala Pro Leu Asp Ile Gly Gly Ala Asp Gln Glu Phe 
340 345 1350 


Gly Leu His Ile Gly Pro Val Cys Phe Lys 
355 360 


<210> SEO ID NO 12 

<211> LENGTH: 18 

<212> TYPE: PRT 

<213> ORGANISM: Tyrannosaurus rex 


<400> SEOUENCE: 12 


Gly Leu Pro Gly Glu Ser Gly Ala Val Gly Pro Ala Gly Pro Pro Gly 
1 5 10 15 


Ser Arg 


<210> SEO ID NO 13 

<211> LENGTH: 18 

<212> TYPE: PRT 

<213> ORGANISM: Brachylophosaurus canadensis 


<400> SEOUENCE: 13 


Gly Ala Thr Gly Leu Pro Gly Val Ala Gly Ala Pro Gly Leu Pro Gly 
1 5 10 15 


Pro Arg 


<210> SEO ID NO 14 

<211> LENGTH: 1985 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Seguence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
polypeptide 

<220> FEATURE: 

<221> NAME/KEY: MOD RES 

<222> LOCATION: (1306)..(1306) 
<223> OTHER INFORMATION: Sor T 
<220> FEATURE: 

<221> NAME/KEY: MOD RES 

<222> LOCATION: (1668)..(1668 

<223> OTHER INFORMATION: D or N 
<220> FEATURE: 

<221> NAME/KEY: MOD RES 

<222> LOCATION: (1915)..(1915 

<223> OTHER INFORMATION: V or A 


<400> SEOUENCE: 14 


Met Ser Ser Asp Ser Glu Met Ala Val Phe Gly Glu Ala Ala Pro Tyr 
1 5 10 L5 
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Leu 


Asp 


Lys 


Leu 


65 


Asn 


Leu 


Trp 


Tyr 


Gly 


145 


Asn 


Ile 


Ile 


Pro 


Ala 


225 


Asp 


Thr 


Ser 


Tyr 


Ile 


305 


Thr 


Ala 


Lys 


Lys 


Lys 
385 


Leu 


Arg 


Ser 


Gly 


50 


Asp 


Pro 


His 


Met 


Lys 


130 


Lys 


Ala 


Thr 


Gln 


Gln 


210 


Asn 


Asn 


Gly 


Arg 


Gln 


290 


Ser 


Val 


Ile 


Leu 


Gln 


370 


Ala 


Cvs 


Lvs 


Lvs 


35 


Met 


Asp 


Pro 


Glu 


Ile 


115 


Trp 


Lys 


Tyr 


Gly 


Tyr 


195 


Thr 


Pro 


Ser 


Lys 


Val 


275 


Ile 


Thr 


Ala 


Asp 


Thr 
355 
Arg 


Gly 


Cys 


Pro 


20 


Lys 


Ile 


Arg 


Lys 


Pro 


100 


Tyr 


Leu 


Arg 


Gln 


Glu 


180 


Phe 


Gly 


Leu 


Ser 


Leu 


260 


Thr 


Met 


Asn 


Ser 


Ile 


340 


Gly 


Glu 


Tyr 


Pro 


Glu 


Ala 


Gln 


Thr 


Phe 


85 


Ala 


Thr 


Pro 


Gln 


Phe 


165 


Ser 


Ala 


Lys 


Leu 


Arg 


245 


Ala 


Phe 


ser 


Pro 


Tle 


325 


Leu 


Ala 


Glu 


Leu 


Arg 
405 


Lys 


Cys 


Asn 


Ile 


70 


Asp 


Val 


Tyr 


val 


Glu 


150 


Met 


Gly 


Thr 


Met 


Glu 


230 


Phe 


ser 


Gln 


Asn 


Phe 


310 


Asp 


Gly 


Val 


Gin 


Met 
390 


Val 


Glu 


Phe 


Arg 


55 


Thr 


Lys 


Leu 


Ser 


Tyr 


135 


Ala 


Leu 


Ala 


Ile 


Gin 


215 


Ala 


Gly 


Ala 


Leu 


Lys 


295 


Asp 


Asp 


Phe 


Met 


Ala 
375 


Gly 


Lys 


Arg 


Ala 


40 


Glu 


Leu 


Ile 


Tyr 


Gly 


120 


Asn 


Pro 


Thr 


Gly 


Ala 


200 


Gly 


Phe 


Lys 


Asp 


Ser 


280 


Lys 


Phe 


Ser 


Ser 


His 


360 


Glu 


Leu 


Val 


Thr 


25 


Val 


Asn 


Asn 


Glu 


Asn 


105 


Leu 


Pro 


Pro 


Asp 


Lys 


185 


Val 


Thr 


Gly 


Phe 


Ile 


265 


Ser 


Pro 


Pro 


Glu 


Ser 


345 


Tyr 


Pro 


Asn 


Gly 


Glu 


Asp 


Asp 


Thr 


Asp 


90 


Leu 


Phe 


Glu 


His 


Arg 


170 


Thr 


Pro 


Leu 


Asn 


Ile 


250 


Glu 


Glu 


Glu 


Phe 


Glu 


330 


Glu 


Gly 


Asp 


Ser 


Asn 
410 


Ala 


Asp 


Lys 


Ala 


75 


Met 


Lys 


Cvs 


Val 


Ile 


155 


Asp 


Val 


Gly 


Glu 


Ala 


235 


Arg 


Thr 


Arg 


Leu 


Ile 


315 


Leu 


Glu 


Asn 


Gly 


Ala 


395 


Glu 


44 


-continued 


Gln Asn 


Lvs Glu 
45 


Val Thr 
60 


Gln Val 


Ala Met 


Glu Arg 


Val Thr 
125 


Val Met 
140 


Phe Ser 


Asn Gln 


Asn Thr 


Asp Lys 


205 


Asp Gln 
220 


Lys Thr 


Ile His 


Tyr Leu 


Ser Tyr 
285 


Ile Asp 
300 


Ser Gln 


Leu Ala 


Lys Val 
Met Lys 
365 


Thr Glu 
380 


Glu Met 


Tyr Val 


Arg 


30 


Val 


Val 


Phe 


Met 


Tyr 


110 


Val 


Ala 


Ile 


Ser 


Lys 


190 


Lys 


Ile 


Val 


Phe 


Leu 


270 


His 


Leu 


Gly 


Thr 


Gly 


350 


Phe 


Val 


Leu 


Thr 


Pro 


Tyr 


Lys 


Pro 


Thr 


95 


Ala 


Asn 


Tyr 


Ser 


Ile 


175 


Arg 


Lys 


Ile 


Arg 


Gly 


255 


Glu 


Ile 


Leu 


Glu 


Asp 


335 


Ile 


Lys 


Ala 


Lys 


Lys 
415 


Phe 


Val 


Thr 


Met 


80 


His 


Ala 


Pro 


Arg 


Asp 


160 


Leu 


Val 


Glu 


Gln 


Asn 


240 


Ala 


Lys 


Phe 


Leu 


Val 


320 


Asn 


Tyr 


Gin 


Asp 


Gly 


400 


Gly 


Oct. 20, 2022 
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Gln Asn Val Gln Gln Val Thr Asn Ser Val Gly Ala Leu Ala Lys Ala 
420 425 430 


Val Tyr Glu Lys Met Phe Leu Trp Met Val Ala Arg Ile Asn Gln Gln 
435 440 445 


Leu Asp Thr Lys Gln Ala Arg Gln Tyr Phe Ile Gly Val Leu Asp Ile 
450 455 460 


Ala Gly Phe Glu Ile Phe Asp Phe Asn Ser Leu Glu Gln Leu Cys Ile 
465 470 475 480 


Asn Phe Thr Asn Glu Lys Leu Gln Gln Phe Phe Asn His His Met Phe 
485 490 495 


Val Leu Glu Gln Glu Glu Tyr Lys Lys Glu Gly Ile Glu Trp Glu Phe 
500 505 510 


Ile Asp Phe Gly Met Asp Leu Ala Ala Cys Ile Glu Leu Ile Glu Lys 
515 520 525 


Pro Met Gly Ile Phe Ser Ile Leu Glu Glu Glu Cys Met Phe Pro Lys 
530 535 540 


Ala Thr Asp Thr Ser Phe Lys Asn Lys Leu Tyr Asp Gln His Leu Gly 
545 550 555 560 


Lys Ser Ala Asn Phe Gln Lys Pro Lys Pro Thr Lys Gly Lys Ala Glu 
565 570 575 


Ala His Phe Ser Leu Val His Tyr Ala Gly Thr Val Asp Tyr Asn Ile 
580 585 590 


Val Gly Trp Leu Asp Lys Asn Lys Asp Pro Leu Asn Glu Thr Val Val 
595 600 605 


Gly Leu Tyr Gln Lys Ser Ser Leu Lys Leu Leu Ala Phe Leu Phe Ser 
610 615 620 


Asn Tyr Ala Gly Ala Glu Ala Gly Asp Cys Gly Gly Ser Lys Lys Gly 
625 630 635 640 


Gly Lys Lys Lys Gly Ser Ser Phe Gln Thr Val Ser Ala Val Phe Arg 
645 650 655 


Glu Asn Leu Asn Lys Leu Met Thr Asn Leu Arg Ser Thr His Pro His 
660 665 670 


Phe Val Arg Cys Leu Ile Pro Asn Glu Thr Lys Thr Pro Gly Val Met 
675 680 685 


Asp His Tyr Leu Val Ile His Gln Leu Arg Cys Asn Gly Val Leu Glu 
690 695 700 


Gly Ile Arg Ile Cys Arg Lys Gly Phe Pro Ser Arg Ile Leu Tyr Gly 
705 710 715 720 


Asp Phe Arg Gln Arg Tvr Arg Ile Leu Asn Ala Ser Ala Ile Pro Glu 
725 730 735 


Glv Gln Phe Ile Asp Ser Lvs Asn Ala Ser Glu Lvs Leu Leu Asn Ser 
740 745 750 


Ile Asn Val Asp Arg Glu Gln Tvr Arg Phe Gly His Thr Lvs Val Phe 
755 760 765 


Phe Lys Ala Gly Leu Leu Gly Leu Leu Glu Glu Met Arg Asp Glu Lys 
770 775 780 


Leu Ala Thr Leu Met Thr Arg Thr Gln Ala Val Cys Arg Gly Tyr Leu 
785 790 195 800 


Met Arg Val Glu Phe Lys Lys Met Met Glu Arg Arg Asp Ser Ile Leu 
805 810 815 


Cys Ile Gln Asp Asn Ile Arg Ser Phe Met Asn Val Lys His Trp Pro 
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820 825 830 


Trp Met Asn Leu Phe Phe Lys Ile Gln Pro Leu Leu Lys Ser Ala Glu 
835 840 845 


Ala Glu Lys Glu Met Ala Thr Met Lys Glu Asn Phe Lys Arg Ala Gln 
850 855 860 


Glu Glu Leu Ala Arg Ser Glu Ala Arg Arg Lys Glu Leu Glu Glu Lys 
865 870 875 880 


Met Val Ser Leu Leu Gln Glu Lys Asn Asp Leu Gln Leu Gln Val Gln 
885 890 895 


Ser Glu Thr Glu Asn Leu Met Asp Ala Glu Glu Arg Cys Glu Gly Leu 
900 905 910 


Ile Lys Ser Lys Ile Gln Leu Glu Ala Lys Val Lys Glu Leu Asn Glu 
915 920 925 


Lys Leu Glu Glu Glu Glu Glu Ile Asn Ser Glu Leu Val Ala Lys Lys 
930 935 940 


Arg Asn Leu Glu Asp Lys Cys Phe Ser Leu Lys Lys Asp Ile Asp Asp 
945 950 955 960 


Leu Glu Leu Thr Leu Thr Lys Val Glu Lys Glu Lys His Ala Thr Glu 
965 970 975 


Asn Lys Val Lys Asn Leu Ser Glu Glu Met Thr Ala Leu Glu Glu Asn 
980 985 990 


Ile Ser Lys Leu Thr Lys Glu Lys Lys Ser Leu Gln Glu Ala His Gln 
995 1000 1005 


Gln Thr Leu Asp Gly Leu Gln Val Glu Glu Asp Lys Val Asn Gly 
1010 1015 1020 


Leu Ile Lys Ile Asn Ala Lys Leu Glu Gln Gln Thr Asp Asp Leu 
1025 1030 1035 


Glu Gly Cys Leu Glu Gln Glu Lys Lys Leu Arg Ala Asp Leu Gly 
1040 1045 1050 


Arg Leu Lys Arg Lys Leu Glu Gly Asp Leu Lys Ile Ser Gln Glu 
1055 1060 1065 


Ser Ile Thr Asp Leu Glu Asn Ala Gln Gln Gln Met Glu Glu Lys 
1070 1075 1080 


Leu Lys Lys Lys Glu Phe Glu Ile Ser Gln Leu Gln Thr Lys Ile 
1085 1090 1095 


Asp Gly Glu Gln Val His Ser Leu Gln Leu Gln Lys Lys Ile Lys 
1100 1105 1110 


Glu Leu Gln Ala Arg Thr Glu Glu Leu Glu Glu Asp Ile Glu Ala 
15 1120 1125 


Glu His Thr Leu Arg Ala Lys Leu Glu Arg Gln Arg Ser Asp Leu 
30 1135 1140 


Ala Arg Glu Leu Glu Glu Ile Ser Glu Arg Leu Glu Glu Ala Ser 
45 1150 1155 


Gly Ala Thr Ser Ala Gln Val Glu Leu Asn Lys Lys Arg Glu Ala 
60 1165 1170 


Glu Phe Gln Lys Met Arg Arg Asp Leu Glu Glu Ala Thr Leu Gln 
75 1180 1185 


His Glu Ala Thr Ala Ala Thr Leu Arg Lys Lys His Ala Asp Ser 
90 1195 1200 


Val Ala Glu Leu Gly Glu Gln Ile Asp Asn Leu Gln Arg Val Lys 
205 1210 1215 
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Gln Lys Leu Gln Lys Glu Lys Ser Glu Leu Lys Met Glu Met Asp 
220 225 1230 


Asp Met Ala Ser Asn Ile Glu Thr Ile Thr Lvs Ser Lys Ser Asn 
235 240 1245 


Met Glu Arg Met Cys Arg Thr Met Glu Asp Gln Phe Asn Glu Leu 
250 255 1260 


Lys Ala Lys Asp Asp Gln Gln Thr Gln Leu Ile His Asp Leu Asn 
265 270 1275 


Met Gln Lys Ala Arg Leu Gln Thr Gln Asn Gly Glu Leu Ser His 
280 285 1290 


Gln Val Glu Glu Lys Glu Ser Leu Ile Ser Gln Leu Xaa Lys Gly 
295 300 1305 


Lys Gln Ala Leu Thr Gln Gln Leu Glu Glu Leu Lys Arg Gln Leu 
310 315 1320 


Glu Glu Glu Thr Lys Ala Lys Asn Ala Leu Ala His Ala Leu Gln 
325 1330 1335 


Ser Ser Arg His Asp Cys Asp Leu Leu Arg Glu Gln Tyr Glu Glu 
1340 1345 1350 


Glu Gln Glu Gly Lys Ala Glu Val Gln Arg Ala Leu Ser Lys Ala 
1355 1360 1365 


Asn Ser Glu Val Ala Gln Trp Arg Thr Lys Tyr Glu Thr Asp Ala 
1370 1375 1380 


Ile Gln Arg Thr Glu Glu Leu Glu Glu Ala Lys Lys Lys Leu Ala 
1385 1390 1395 


Gln Arg Leu Gln Glu Ala Glu Glu Asn Thr Glu Ala Val Asn Ser 
1400 1405 1410 


Lys Cys Ala Ser Leu Glu Lys Thr Lys Gln Arg Leu Gln Gly Glu 
1415 1420 1425 


Val Asp Asp Leu Met Leu Asp Leu Glu Arg Ala Asn Thr Thr Cys 
1430 1435 1440 


Ala Thr Leu Gly Lys Lys Gln Arg Asn Phe Asp Lys Ile Leu Ala 
1445 1450 1455 


Asp Trp Lys Gln Lys Leu Asp Glu Ser His Ala Glu Leu Glu Ala 
1460 1465 1470 


Ala Gln Lys Glu Ser Arg Ser Leu Ser Thr Glu Ile Phe Lys Met 
1475 1480 1485 


Lys Asn Ala Tyr Glu Glu Val Val Asp Gln Leu Glu Thr Leu Arg 
490 1495 1500 


Arg Glu Asn Lys Asn Leu Gln Glu Glu Ile His Asp Val Thr Glu 
505 1510 1515 


Gln Ile Ala Glu Thr Gly Lys Asn Leu Gln Glu Met Glu Lys Thr 
520 1525 1530 


Lys Lys Gln Val Glu Gln Glu Lys Ser Asp Leu Gln Val Ala Leu 
535 540 1545 


Glu Glu Val Glu Gly Ser Leu Glu His Glu Glu Ser Lys Ile Leu 
550 555 1560 


Arg Val Gln Leu Glu Leu Ser Gln Leu Lys Ser Glu Leu Asp Arg 
565 570 1575 


Lys Val Ala Glu Lys Asp Glu Glu Met Glu Gln Leu Lys Arg Asn 
580 585 1590 
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Cys Gln Arg Ala Ala Glu Ala Thr Gln Ser Leu Leu Glu Ala Glu 
595 600 1605 


Ser Arg Ser Arg Lys Asp Ala Leu Arg Leu Lys Lys Lys Met Glu 
610 615 1620 


Gly Asp Leu Asn Glu Met Glu Val His Leu Gly His Ala Asn Arg 
625 630 1635 


Gln Val Ala Glu Thr Gln Lys His Leu Arg Thr Val Gln Gly Gln 
640 645 1650 


Leu Lys Gly Ser Gln Leu His Leu Gly Asp Ala Leu Arg Ser Xaa 
655 660 1665 


Glu Ala Leu Lys Glu Arg Leu Ala Ile Val Gly Arg Arg Asn Gly 
670 675 1680 


Leu Leu Val Glu Glu Leu Glu Glu Thr Lys Val Ala Leu Glu Gln 
685 690 1695 


Ala Glu Arg Ser Arg Arg Leu Ser Glu Gln Glu Leu Gln Asp Ala 
700 705 1710 


Ser Asp Arg Val Gln Leu Leu Leu Ser Arg Cys Asn Thr Ser Leu 
1715 1720 1725 


Ile Asn Ile Lys Lys Lys Leu Glu Thr Asn Ile Ala Gin Cys Gln 
1730 1735 1740 


Ala Glu Val Glu Asn Ser Val Gln Glu Ser Arg Asn Ala Glu Glu 
1745 1750 1755 


Lys Ala Lys Lys Ala Ile Thr Asp Ala Ala Met Met Ala Glu Glu 
1760 1765 1770 


Leu Lys Lys Glu Gln Asp Thr Ser Ala His Leu Glu Arg Met Lys 
1775 1780 1785 


Lys Asn Leu Glu Gln Thr Val Lys Asp Met Gln His Arg Leu Asp 
1790 1795 1800 


Glu Ala Glu Gln Leu Ala Leu Lys Gly Asp Lys Lys Gln Ile Gln 
1805 1810 1815 


Lys Leu Glu Asn Arg Val Arg Glu Leu Glu Ser Glu Leu Asp Thr 
1820 1825 1830 


Glu Gln Lys Arg Gly Ala Glu Ala Leu Lys Gly Ala His Lys Tyr 
1835 1840 1845 


Glu Arg Lys Val Lys Glu Met Ala Tyr Gln Ala Glu Glu Asp His 
1850 1855 1860 


Lys Asn Ser Leu Arg Leu Gln Asp Leu Val Asp Lys Leu Gln Ala 
1865 1870 1875 


Lys Val Asn Ala Tyr Lys Arg Gln Ala Glu Glu Ala Glu Glu Gln 
880 1885 1890 


Ala Asn Thr Gln Leu Ser Arg Cys Arg Arg Val Gln His Glu Leu 
895 1900 1905 


Glu Glu Ala Glu Glu Arg Xaa Asp Ile Ala Glu Ser Gln Val Asn 
910 1915 1920 


Lys Leu Arg Ala Lys Ser Arg Asp Val Gly Ser Gln Val Arg Ala 
925 1930 1935 


Asp Thr Trp Ala Trp Leu Leu Leu Lys Glu Ser Arg Asp Pro Gly 
940 1945 1950 


Ala Gln Arg Ser His Arg Glu Ser Leu Glu Ala Ala His Trp Ser 
955 1960 1965 


Leu Leu Leu Val Met Pro Ser Gln Met Pro Pro Thr Gin Gln Arg 
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1970 1975 


Leu Leu 
1985 


<210> SEQ ID NO 15 
<211> LENGTH: 1363 
<212> TYPE: PRT 


<213> ORGANISM: Artificial Sequence 


<220> FEATURE: 


<223> OTHER INFORMATION: Description of Artificial Sequence: 


polypeptide 
<220> FEATURE: 
<221> NAME/KEY: MOD_RES 
<222> LOCATION: (25)..(25) 
<223> OTHER INFORMATION: V or L 
<220> FEATURE: 
<221> NAME/KEY: MOD_RES 
<222> LOCATION: (26)..(26) 
<223> OTHER INFORMATION: S or F 


<400> SEQUENCE: 15 


Met Leu Ser Phe Val Asp Thr Arg 
1 5 


Ser Tyr Leu Ala Thr Ser Gln His 
20 


Lys Gly Pro Arg Gly Asp Lys Gly 
35 40 


Gly Pro Pro Gly Arg Asp Gly Glu 
50 55 


Pro Pro Gly Pro Pro Gly Leu Gly 
65 70 


Pro Ser Lys Ala Ala Asp Phe Gly 
85 


Pro Arg Gly Pro Pro Gly Ala Ser 
100 


Gln Gly Val Pro Gly Glu Pro Gly 
115 120 


Gly Pro Arg Gly Pro Pro Gly Pro 
130 155 


His Pro Gly Lys Pro Gly Arg Pro 
145 150 


Gln Gly Ala Arg Gly Phe Pro Gly 
165 


Gly Ile Arg Gly His Asn Gly Leu 
180 


Thr Pro Gly Thr Lys Gly Glu Pro 
195 200 


Pro Gly Gln Pro Gly Ala Arg Gly 
210 215 


Glv Ala Pro Gly Pro Ala Gly Ala 
225 230 


Pro Thr Gly Pro Ala Gly Pro Ile 
245 


Pro Gly Ala Pro Gly Ala Lys Gly 
260 


Gly Pro Thr Gly Pro Ala Gly Pro 


Ile 


Xaa 


25 


Pro 


Asp 


Gly 


Pro 


Gly 


105 


Glu 


Pro 


Gly 


Thr 


Asp 


185 


Gly 


Leu 


Arg 


Gly 


Glu 
265 


Arg 


Leu 


10 


Xaa 


Gln 


Gly 


Asn 


Gly 


90 


Pro 


Pro 


Gly 


Glu 


Pro 


170 


Gly 


Ala 


Pro 


Gly 


Ala 
250 


Ile 


Gly 


Leu 


Glx 


Gly 


Pro 


Phe 


75 


Pro 


Pro 


Gly 


Lys 


Arg 


155 


Gly 


Gln 


Pro 


Gly 


Ser 


235 


Ala 


Gly 


Glu 


49 


-continued 


1980 


Leu 


Ala 


Glu 


Pro 


60 


Ala 


Met 


Gly 


Gin 


Ala 


140 


Gly 


Leu 


Lys 


Gly 


Glu 


220 


Asp 


Gly 


Pro 


Ile 


Leu 


Ser 


Arg 


45 


Gly 


Ala 


Gly 


Pro 


Thr 


125 


Gly 


Val 


Pro 


Gly 


Glu 


205 


Arg 


Gly 


Pro 


Ala 


Gly 


Ala 


Ala 


30 


Gly 


Pro 


Gln 


Leu 


Pro 


110 


Gly 


Glu 


Ala 


Gly 


Glin 


190 


Asn 


Gly 


Ser 


Pro 


Gly 


270 


Leu 


Val 


15 


Gly 


Pro 


Pro 


Tvr 


Met 


95 


Gly 


Pro 


Asp 


Gly 


phe 


175 


Pro 


Gly 


Arg 


Ala 


Gly 
255 


Asn 


Pro 


Svnthetic 


Thr 


Arg 


Pro 


Gly 


Asp 


80 


Gly 


Phe 


Gln 


Gly 


Pro 


160 


Lvs 


Gly 


Thr 


Ile 


Gly 
240 
Phe 


Val 


Gly 
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-continued 
275 280 285 


Ser Ser Gly Pro Val Gly Pro Pro Gly Asn Pro Gly Ala Asn Gly Leu 
290 295 300 


Pro Gly Ala Lys Gly Ala Thr Gly Leu Pro Gly Val Ala Gly Ala Pro 
305 310 315 320 


Gly Leu Pro Gly Pro Arg Gly Ile Pro Gly Pro Pro Gly Pro Ala Gly 
325 330 335 


Pro Ser Gly Ala Arg Gly Leu Val Gly Glu Pro Gly Pro Ala Gly Ala 
340 345 350 


Lys Gly Glu Ser Gly Asn Lys Gly Glu Pro Gly Ala Ala Gly Pro Pro 
355 360 365 


Gly Pro Pro Gly Pro Ser Gly Glu Glu Gly Lys Arg Gly Ser Asn Gly 
370 375 380 


Glu Pro Gly Ser Ala Gly Pro Pro Gly Pro Ala Gly Leu Arg Gly Val 
385 390 395 400 


Pro Gly Ser Arg Gly Leu Pro Gly Ala Asp Gly Arg Ala Gly Val Met 
405 410 415 


Gly Pro Ala Gly Asn Arg Gly Ala Ser Gly Pro Val Gly Ala Lys Gly 
420 425 430 


Pro Asn Gly Asp Ala Gly Arg Pro Gly Glu Pro Gly Leu Met Gly Pro 
435 440 445 


Arg Gly Leu Pro Gly Gln Pro Gly Ser Pro Gly Pro Ala Gly Lys Glu 
450 455 460 


Gly Pro Val Gly Phe Pro Gly Ala Asp Gly Arg Val Gly Pro Ile Gly 
465 470 475 480 


Pro Ala Gly Asn Arg Gly Glu Pro Gly Asn Ile Gly Phe Pro Gly Pro 
485 490 495 


Lys Gly Pro Thr Gly Glu Pro Gly Lys Pro Gly Glu Lys Gly Asn Val 
500 505 510 


Gly Leu Ala Gly Pro Arg Gly Ala Pro Gly Pro Glu Gly Asn Asn Gly 
515 520 525 


Ala Gln Gly Pro Pro Gly Val Thr Gly Asn Gln Gly Ala Lys Gly Glu 
530 535 540 


Thr Gly Pro Ala Gly Pro Pro Gly Phe Gln Gly Leu Pro Gly Pro Ser 
545 550 555 560 


Gly Pro Ala Gly Glu Ala Gly Lys Pro Gly Glu Arg Gly Leu His Gly 
565 570 575 


Glu Phe Gly Val Pro Gly Pro Ala Gly Pro Arg Gly Glu Arg Gly Leu 
580 585 590 


Pro Gly Glu Ser Gly Ala Val Gly Pro Ala Gly Pro Pro Gly Ser Arg 
595 600 605 


Gly Pro Ser Gly Pro Pro Gly Pro Asp Gly Asn Lys Gly Glu Pro Gly 
610 615 620 


Asn Val Gly Pro Ala Gly Ala Pro Gly Pro Ala Gly Pro Gly Gly Ile 
625 630 635 640 


Pro Gly Glu Arg Gly Val Ala Gly Val Pro Gly Gly Lys Gly Glu Lys 
645 650 655 


Gly Ala Pro Gly Leu Arg Gly Asp Thr Gly Ala Thr Gly Arg Asp Gly 
660 665 670 


Ala Arg Gly Leu Pro Gly Ala Ile Gly Ala Pro Gly Pro Ala Gly Gly 
675 680 685 
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Ala Gly Asp Arg Gly Glu Gly Gly Pro Ala Gly Pro Ala Gly Pro Ala 
690 695 700 


Gly Ala Arg Gly Ile Pro Gly Glu Arg Gly Glu Pro Gly Pro Val Gly 
705 710 715 720 


Pro Ser Gly Phe Ala Gly Pro Pro Gly Ala Ala Gly Gln Pro Gly Ala 
725 730 735 


Lys Gly Glu Arg Gly Pro Lys Gly Pro Lys Gly Glu Thr Gly Pro Thr 
740 745 750 


Gly Ala Ile Gly Pro Ile Gly Ala Ser Gly Pro Pro Gly Pro Val Gly 
755 760 765 


Ala Ala Gly Pro Ala Gly Pro Arg Gly Asp Ala Gly Pro Pro Gly Met 
770 775 780 


Thr Gly Phe Pro Gly Ala Ala Gly Arg Val Gly Pro Pro Gly Pro Ala 
785 790 795 800 


Gly Ile Thr Gly Pro Pro Gly Pro Pro Gly Pro Ala Gly Lys Asp Gly 
805 810 815 


Pro Arg Gly Leu Arg Gly Asp Val Gly Pro Val Gly Arg Thr Gly Glu 
820 825 830 


Gln Gly Ile Ala Gly Pro Pro Gly Phe Ala Gly Glu Lys Gly Pro Ser 
835 840 845 


Gly Glu Ala Gly Ala Ala Gly Pro Pro Gly Thr Pro Gly Pro Gln Gly 
850 855 860 


Ile Leu Gly Ala Pro Gly Ile Leu Gly Leu Pro Gly Ser Arg Gly Glu 
865 870 875 880 


Arg Gly Leu Pro Gly Ile Ala Gly Ala Thr Gly Glu Pro Gly Pro Leu 
885 890 895 


Gly Val Ser Gly Pro Pro Gly Ala Arg Gly Pro Ser Gly Pro Val Gly 
900 905 910 


Ser Pro Gly Pro Asn Gly Ala Pro Gly Glu Ala Gly Arg Asp Gly Asn 
915 920 925 


Pro Gly Asn Asp Gly Pro Pro Gly Arg Asp Gly Ala Pro Gly Phe Lys 
930 935 940 


Gly Glu Arg Gly Ala Pro Gly Asn Pro Gly Pro Ser Gly Ala Leu Gly 
945 950 955 960 


Ala Pro Gly Pro His Gly Gln Val Gly Pro Ser Gly Lvs Pro Gly Asn 
965 970 978 


Arg Gly Asp Pro Gly Pro Val Gly Pro Val Gly Pro Ala Gly Ala Phe 
980 985 990 


Gly Pro Arg Gly Leu Ala Gly Pro Gln Gly Pro Arg Gly Glu Lys Gly 
995 1000 1005 


Glu Pro Gly Asp Lys Gly His Arg Gly Leu Pro Gly Leu Lys Gly 
010 1015 1020 


His Asn Gly Leu Gln Gly Leu Pro Gly Leu Ala Gly Gln His Gly 
025 030 1035 


Asp Gln Gly Pro Pro Gly Asn Asn Gly Pro Ala Gly Pro Arg Gly 
040 045 1050 


Pro Pro Gly Pro Ser Gly Pro Pro Gly Lys Asp Gly Arg Asn Gly 
055 060 1065 


Leu Pro Giv Pro Ile Gly Pro Ala Gly Val Arg Gly Ser His Gly 
070 075 1080 
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Ser Gln Gly Pro Ala Gly Pro Pro Gly Pro Pro Gly Pro Pro Gly 
085 090 1095 


Pro Pro Gly Pro Asn Gly Gly Gly Tyr Glu Val Gly Phe Asp Ala 
00 05 1110 


Glu Tyr Tvr Arg Ala Asp Gln Pro Ser Leu Arg Pro Lys Asp Tyr 
15 20 1125 


Glu Val Asp Ala Thr Leu Lys Thr Leu Asn Asn Gln Ile Glu Thr 
30 35 1140 


Leu Leu Thr Pro Glu Gly Ser Lys Lys Asn Pro Ala Arg Thr Cys 
45 50 1155 


Arg Asp Leu Arg Leu Ser His Pro Glu Trp Ser Ser Gly Phe Tyr 
60 65 1170 


Trp Ile Asp Pro Asn Gln Gly Cys Thr Ala Asp Ala Ile Arg Ala 
75 80 1185 


Tyr Cys Asp Phe Ala Thr Gly Glu Thr Cys Ile His Ala Ser Leu 
90 95 1200 


Glu Asp Ile Pro Thr Lys Thr Trp Tyr Val Ser Lys Asn Pro Lys 
1205 210 1215 


Asp Lys Lys His Ile Trp Phe Gly Glu Thr Ile Asn Gly Gly Thr 
1220 1225 1230 


Gln Phe Glu Tyr Asn Gly Glu Gly Val Thr Thr Lys Asp Met Ala 
1235 1240 1245 


Thr Gln Leu Ala Phe Met Arg Leu Leu Ala Asn His Ala Ser Gln 
1250 1255 1260 


Asn Ile Thr Tyr His Cys Lys Asn Ser Ile Ala Tyr Met Asp Glu 
1265 1270 1275 


Glu Thr Gly Asn Leu Lys Lys Ala Val Ile Leu Gln Gly Ser Asn 
1280 1285 1290 


Asp Val Glu Leu Arg Ala Glu Gly Asn Ser Arg Phe Thr Phe Ser 
1295 1300 1305 


Val Leu Val Asp Gly Cys Ser Lys Lys Asn Asn Lys Trp Gly Lys 
1310 1315 1320 


Thr Ile Ile Glu Tyr Arg Thr Asn Lys Pro Ser Arg Leu Pro Ile 
1325 1330 1335 


Leu Asp Ile Ala Pro Leu Asp Ile Gly Gly Ala Asp Gln Glu Phe 
1340 1345 1350 


Gly Leu His Ile Gly Pro Val Cys Phe Lys 


<210> SEQ ID NO 16 

<211> LENGTH: 1985 

<212> TYPE: PRT 

<213> ORGANISM: Loxodonta africana 


<400> SEQUENCE: 16 


Met Ser Ser Asp Ser Glu Met Ala Val Phe Gly Glu Ala Ala Pro Tyr 
$ 5 10 15 


Leu Arg Lys Pro Glu Lys Glu Arg Thr Glu Ala Gln Asn Arg Pro Phe 
20 25 30 


Asp Ser Lys Lys Ala Cys Phe Ala Val Asp Asp Lys Glu Val Tyr Val 
35 40 45 


Lys Gly Met Ile Gln Asn Arg Glu Asn Asp Lys Val Thr Val Lys Thr 
50 55 60 
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Leu 


65 


Asn 


Leu 


Trp 


Tyr 


Gly 


145 


Asn 


Ile 


Ile 


Pro 


Ala 


225 


Asp 


Thr 


Ser 


Tyr 


Ile 


305 


Thr 


Ala 


Leu 


Gin 


Val 


Leu 


Asp 


Pro 


His 


Met 


Lvs 


130 


Lvs 


Ala 


Thr 


Gln 


Gln 


210 


Asn 


Asn 


Gly 


Arg 


Gln 


290 


Ser 


Val 


Ile 


Leu 


Gln 


370 


Ala 


Cys 


Asn 


Tyr 


Asp 
450 


Asp 


Pro 


Glu 


Ile 


115 


Trp 


Lvs 


Tvr 


Gly 


Tyr 


195 


Thr 


Pro 


Ser 


Lys 


Val 


275 


Ile 


Thr 


Ala 


Asp 


Thr 


355 


Arg 


Gly 


Cvs 


Val 


Glu 
435 


Thr 


Arg 


Lvs 


Pro 


100 


Tyr 


Leu 


Arg 


Gln 


Glu 


180 


Phe 


Gly 


Leu 


Ser 


Leu 


260 


Thr 


Met 


Asn 


Ser 


Ile 


340 


Gly 


Glu 


Tyr 


Pro 


Gin 
420 


Lys 


Lys 


Thr 


Phe 


85 


Ala 


Thr 


Pro 


Gln 


Phe 


165 


Ser 


Ala 


Lys 


Leu 


Arg 


245 


Ala 


Phe 


Ser 


Pro 


Ile 


325 


Leu 


Ala 


Glu 


Leu 


Arg 


405 


Gln 


Met 


Gin 


Ile 


70 


Asp 


Val 


Tvr 


Val 


Glu 


150 


Met 


Gly 


Thr 


Met 


Glu 


230 


Phe 


Ser 


Gln 


Asn 


Phe 


310 


Asp 


Gly 


Val 


Gln 


Met 


390 


Val 


Val 


Phe 


Ala 


Thr 


Lys 


Leu 


Ser 


Tyr 


135 


Ala 


Leu 


Ala 


Ile 


Gln 


215 


Ala 


Gly 


Ala 


Leu 


Lys 


295 


Asp 


Asp 


Phe 


Met 


Ala 


375 


Gly 


Lys 


Thr 


Leu 


Arg 
455 


Leu 


Ile 


Tyr 


Gly 


120 


Asn 


Pro 


Thr 


Gly 


Ala 


200 


Gly 


Phe 


Lys 


Asp 


Ser 


280 


Lys 


Phe 


Ser 


Ser 


His 


360 


Glu 


Leu 


Val 


Asn 


Trp 


440 


Gln 


Asn 


Glu 


Asn 


105 


Leu 


Pro 


Pro 


Asp 


Lys 


185 


Val 


Thr 


Gly 


Phe 


Ile 


265 


Ser 


Pro 


Pro 


Glu 


Ser 


345 


Tyr 


Pro 


Asn 


Gly 


Ser 


425 


Met 


Tyr 


Thr 


Asp 


90 


Leu 


Phe 


Glu 


His 


Arg 


170 


Thr 


Pro 


Leu 


Asn 


Ile 


250 


Glu 


Glu 


Glu 


Phe 


Glu 


330 


Glu 


Gly 


Asp 


Ser 


Asn 
410 
Val 


Val 


Phe 


Ala 


75 


Met 


Lys 


Cys 


Val 


Ile 


155 


Asp 


Val 


Gly 


Glu 


Ala 


235 


Arg 


Thr 


Arg 


Leu 


Ile 


315 


Leu 


Glu 


Asn 


Gly 


Ala 


395 


Glu 


Gly 


Ala 


Ile 


53 


-continued 


Gln 


Ala 


Glu 


Val 


Val 


140 


Phe 


Asn 


Asn 


Asp 


Asp 


220 


Lys 


Tle 


Tyr 


Ser 


Tle 


300 


Ser 


Leu 


Lys 


Met 


Thr 


380 


Glu 


Tyr 


Ala 


Arg 


Gly 
460 


Val 


Met 


Arg 


Thr 


125 


Met 


Ser 


Gln 


Thr 


Lys 


205 


Gin 


Thr 


His 


Leu 


Tyr 


285 


Asp 


Gln 


Ala 


Val 


Lys 


365 


Glu 


Met 


Val 


Leu 


Ile 


445 


Val 


Phe 


Met 


Tyr 


110 


Val 


Ala 


Ile 


Ser 


Lys 


190 


Lys 


Ile 


val 


Phe 


Leu 


270 


His 


Leu 


Gly 


Thr 


Gly 


350 


Phe 


Val 


Leu 


Thr 


Ala 


430 


Asn 


Leu 


Pro 


Thr 


95 


Ala 


Asn 


Tyr 


Ser 


Ile 


175 


Arg 


Lys 


Ile 


Arg 


Gly 


255 


Glu 


Ile 


Leu 


Glu 


Asp 


335 


Ile 


Lys 


Ala 


Lys 


Lys 


415 


Lys 


Gin 


Asp 


Met 


80 


His 


Ala 


Pro 


Arg 


Asp 


160 


Leu 


Val 


Glu 


Gln 


Asn 


240 


Ala 


Lys 


Phe 


Leu 


Val 


320 


Asn 


Tyr 


Gin 


Asp 


Gly 


400 


Gly 


Ala 


Gin 


Ile 
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Ala Gly Phe Glu Ile Phe Asp Phe Asn Ser Leu Glu Gln Leu Cys Ile 
465 470 475 480 


Asn Phe Thr Asn Glu Lys Leu Gln Gln Phe Phe Asn His His Met Phe 
485 490 495 


Val Leu Glu Gln Glu Glu Tyr Lys Lys Glu Gly Ile Glu Trp Glu Phe 
500 505 510 


Ile Asp Phe Gly Met Asp Leu Ala Ala Cys Ile Glu Leu Ile Glu Lys 
515 520 525 


Pro Met Gly Ile Phe Ser Ile Leu Glu Glu Glu Cvs Met Phe Pro Lvs 
530 535 540 


Ala Thr Asp Thr Ser Phe Lys Asn Lys Leu Tyr Asp Gln His Leu Gly 
545 550 555 560 


Lys Ser Ala Asn Phe Gln Lys Pro Lys Pro Thr Lys Gly Lys Ala Glu 
565 570 575 


Ala His Phe Ser Leu Val His Tyr Ala Gly Thr Val Asp Tyr Asn Ile 
580 585 590 


Val Gly Trp Leu Asp Lys Asn Lys Asp Pro Leu Asn Glu Thr Val Val 
595 600 605 


Gly Leu Tyr Gln Lys Ser Ser Leu Lys Leu Leu Ala Phe Leu Phe Ser 
610 615 620 


Asn Tyr Ala Gly Ala Glu Ala Gly Asp Cys Gly Gly Ser Lys Lys Gly 
625 630 635 640 


Gly Lys Lys Lys Gly Ser Ser Phe Gln Thr Val Ser Ala Val Phe Arg 
645 650 655 


Glu Asn Leu Asn Lys Leu Met Thr Asn Leu Arg Ser Thr His Pro His 
660 665 670 


Phe Val Arg Cys Leu Ile Pro Asn Glu Thr Lys Thr Pro Gly Val Met 
675 680 685 


Asp His Tyr Leu Val Ile His Gln Leu Arg Cys Asn Gly Val Leu Glu 
690 695 700 


Gly Ile Arg Ile Cys Arg Lys Gly Phe Pro Ser Arg Ile Leu Tyr Gly 
705 710 715 720 


Asp Phe Arg Gln Arg Tyr Arg Ile Leu Asn Ala Ser Ala Ile Pro Glu 
725 730 735 


Gly Gln Phe Ile Asp Ser Lys Asn Ala Ser Glu Lys Leu Leu Asn Ser 
740 745 750 


Ile Asn Val Asp Arg Glu Gln Tyr Arg Phe Gly His Thr Lys Val Phe 
755 760 765 


Phe Lys Ala Gly Leu Leu Gly Leu Leu Glu Glu Met Arg Asp Glu Lys 
770 775 780 


Leu Ala Thr Leu Met Thr Arg Thr Gln Ala Val Cys Arg Gly Tyr Leu 
785 790 795 800 


Met Arg Val Glu Phe Lys Lys Met Met Glu Arg Arg Asp Ser Ile Leu 
805 810 815 


Cvs Ile Gln Asp Asn Ile Arg Ser Phe Met Asn Val Lys His Trp Pro 
820 825 830 


Trp Met Asn Leu Phe Phe Lvs Ile Gln Pro Leu Leu Lvs Ser Ala Glu 
835 840 845 


Ala Glu Lys Glu Met Ala Thr Met Lys Glu Asn Phe Lys Arg Ala Gln 
850 855 860 


Glu Glu Leu Ala Arg Ser Glu Ala Arg Arg Lys Glu Leu Glu Glu Lys 
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865 870 875 880 


Met Val Ser Leu Leu Gln Glu Lys Asn Asp Leu Gln Leu Gln Val Gln 
885 890 895 


Ser Glu Thr Glu Asn Leu Met Asp Ala Glu Glu Arg Cys Glu Gly Leu 
900 905 910 


Ile Lys Ser Lys Ile Gln Leu Glu Ala Lys Val Lys Glu Leu Asn Glu 
915 920 925 


Lys Leu Glu Glu Glu Glu Glu Ile Asn Ser Glu Leu Val Ala Lys Lys 
930 935 940 


Arg Asn Leu Glu Asp Lys Cys Phe Ser Leu Lys Lys Asp Ile Asp Asp 
945 950 955 960 


Leu Glu Leu Thr Leu Thr Lys Val Glu Lys Glu Lys His Ala Thr Glu 
965 970 975 


Asn Lys Val Lys Asn Leu Ser Glu Glu Met Thr Ala Leu Glu Glu Asn 
980 985 990 


Ile Ser Lys Leu Thr Lys Glu Lys Lys Ser Leu Gln Glu Ala His Gln 
995 1000 1005 


Gln Thr Leu Asp Gly Leu Gln Val Glu Glu Asp Lys Val Asn Gly 
1010 1015 1020 


Leu Ile Lys Ile Asn Ala Lys Leu Glu Gln Gln Thr Asp Asp Leu 
1025 1030 1035 


Glu Gly Cys Leu Glu Gln Glu Lys Lys Leu Arg Ala Asp Leu Gly 
1040 1045 1050 


Arg Leu Lys Arg Lys Leu Glu Gly Asp Leu Lys Ile Ser Gln Glu 
1055 1060 1065 


Ser Ile Thr Asp Leu Glu Asn Ala Gln Gln Gln Met Glu Glu Lys 
1070 1075 1080 


Leu Lys Lys Lys Glu Phe Glu Ile Ser Gln Leu Gln Thr Lys Ile 
1085 1090 1095 


Asp Gly Glu Gln Val His Ser Leu Gln Leu Gln Lvs Lys Ile Lys 
1100 1105 1110 


Glu Leu Gln Ala Arg Thr Glu Glu Leu Glu Glu Asp Ile Glu Ala 
1115 1120 1125 


Glu His Thr Leu Arg Ala Lys Leu Glu Arg Gln Arg Ser Asp Leu 
1130 1135 1140 


Ala Arg Glu Leu Glu Glu Ile Ser Glu Arg Leu Glu Glu Ala Ser 
1145 1150 1155 


Gly Ala Thr Ser Ala Gln Val Glu Leu Asn Lys Lys Arg Glu Ala 
160 1165 1170 


Glu Phe Gln Lys Met Arg Arg Asp Leu Glu Glu Ala Thr Leu Gln 
115 1180 1185 


His Glu Ala Thr Ala Ala Thr Leu Arg Lys Lys His Ala Asp Ser 
190 1195 1200 


Val Ala Glu Leu Gly Glu Gln Ile Asp Asn Leu Gln Arg Val Lys 
205 1210 1215 


Gln Lys Leu Gln Lys Glu Lys Ser Glu Leu Lys Met Glu Met Asp 
220 1225 1230 


Asp Met Ala Ser Asn Ile Glu Thr Ile Thr Lys Ser Lys Ser Asn 
235 1240 1245 


Met Glu Arg Met Cys Arg Thr Met Glu Asp Gln Phe Asn Glu Leu 
250 1255 1260 
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Lys Ala Lys Asp Asp Gln Gln Thr Gln Leu Ile His Asp Leu Asn 
265 270 1275 


Met Gln Lys Ala Arg Leu Gln Thr Gln Asn Gly Glu Leu Ser His 
280 285 1290 


Gln Val Glu Glu Lys Glu Ser Leu Ile Ser Gln Leu Thr Lys Gly 
295 300 1305 


Lvs Gln Ala Leu Thr Gln Gln Leu Glu Glu Leu Lvs Arg Gln Leu 
310 315 1320 


Glu Glu Glu Thr Lys Ala Lys Asn Ala Leu Ala His Ala Leu Gln 
325 330 1335 


Ser Ser Arg His Asp Cys Asp Leu Leu Arg Glu Gln Tyr Glu Glu 
340 345 1350 


Glu Gln Glu Gly Lys Ala Glu Val Gln Arg Ala Leu Ser Lys Ala 
355 360 1365 


Asn Ser Glu Val Ala Gln Trp Arg Thr Lys Tyr Glu Thr Asp Ala 
370 1375 1380 


Ile Gln Arg Thr Glu Glu Leu Glu Glu Ala Lys Lys Lys Leu Ala 
1385 1390 1395 


Gln Arg Leu Gln Glu Ala Glu Glu Asn Thr Glu Ala Val Asn Ser 
1400 1405 1410 


Lys Cys Ala Ser Leu Glu Lys Thr Lys Gln Arg Leu Gln Gly Glu 
1415 1420 1425 


Val Asp Asp Leu Met Leu Asp Leu Glu Arg Ala Asn Thr Thr Cys 
1430 1435 1440 


Ala Thr Leu Gly Lys Lys Gln Arg Asn Phe Asp Lys Ile Leu Ala 
1445 1450 1455 


Asp Trp Lys Gln Lys Leu Asp Glu Ser His Ala Glu Leu Glu Ala 
1460 1465 1470 


Ala Gln Lys Glu Ser Arg Ser Leu Ser Thr Glu Ile Phe Lys Met 
1475 1480 1485 


Lys Asn Ala Tyr Glu Glu Val Val Asp Gln Leu Glu Thr Leu Arg 
1490 1495 1500 


Arg Glu Asn Lys Asn Leu Gln Glu Glu Ile His Asp Val Thr Glu 
1505 1510 1515 


Gln Ile Ala Glu Thr Gly Lvs Asn Leu Gln Glu Met Glu Lys Thr 
1520 1525 1530 


Lys Lys Gln Val Glu Gln Glu Lys Ser Asp Leu Gln Val Ala Leu 
535 1540 1545 


Glu Glu Val Glu Gly Ser Leu Glu His Glu Glu Ser Lys Ile Leu 
550 1555 1560 


Arg Val Gln Leu Glu Leu Ser Gln Leu Lys Ser Glu Leu Asp Arg 
565 1570 1575 


Lys Val Ala Glu Lys Asp Glu Glu Met Glu Gln Leu Lys Arg Asn 
580 585 1590 


Cys Gln Arg Ala Ala Glu Ala Thr Gln Ser Leu Leu Glu Ala Glu 
595 600 1605 


Ser Arg Ser Arg Lys Asp Ala Leu Arg Leu Lys Lys Lys Met Glu 
610 615 1620 


Gly Asp Leu Asn Glu Met Glu Val His Leu Gly His Ala Asn Arg 
625 630 1635 
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Gln Val Ala Glu Thr Gln Lys His Leu Arg Thr Val Gln Gly Gln 
640 645 1650 


Leu Lys Gly Ser Gln Leu His Leu Gly Asp Ala Leu Arg Ser Asn 
655 660 1665 


Glu Ala Leu Lvs Glu Arg Leu Ala Ile Val Gly Arg Arg Asn Gly 
670 675 1680 


Leu Leu Val Glu Glu Leu Glu Glu Thr Lys Val Ala Leu Glu Gln 
685 690 1695 


Ala Glu Arg Ser Arg Arg Leu Ser Glu Gln Glu Leu Gln Asp Ala 
700 705 1710 


Ser Asp Arg Val Gln Leu Leu Leu Ser Arg Cys Asn Thr Ser Leu 
FIS 720 1725 


Ile Asn Ile Lys Lys Lys Leu Glu Thr Asn Ile Ala Gin Cys Gln 
730 735 1740 


Ala Glu Val Glu Asn Ser Val Gln Glu Ser Arg Asn Ala Glu Glu 
1745 1750 1755 


Lys Ala Lys Lys Ala Ile Thr Asp Ala Ala Met Met Ala Glu Glu 
1760 1765 1770 


Leu Lys Lys Glu Gln Asp Thr Ser Ala His Leu Glu Arg Met Lys 
1775 1780 1785 


Lys Asn Leu Glu Gln Thr Val Lys Asp Met Gln His Arg Leu Asp 
1790 1795 1800 


Glu Ala Glu Gln Leu Ala Leu Lys Gly Asp Lys Lys Gln Ile Gln 
1805 1810 1815 


Lys Leu Glu Asn Arg Val Arg Glu Leu Glu Ser Glu Leu Asp Thr 
1820 1825 1830 


Glu Gln Lys Arg Gly Ala Glu Ala Leu Lys Gly Ala His Lys Tyr 
1835 1840 1845 


Glu Arg Lys Val Lys Glu Met Ala Tyr Gln Ala Glu Glu Asp His 
1850 1855 1860 


Lys Asn Ser Leu Arg Leu Gln Asp Leu Val Asp Lys Leu Gln Ala 
1865 1870 1875 


Lys Val Asn Ala Tyr Lys Arg Gln Ala Glu Glu Ala Glu Glu Gln 
1880 1885 1890 


Ala Asn Thr Gln Leu Ser Arg Cys Arg Arg Val Gln His Glu Leu 
895 1900 1905 


Glu Glu Ala Glu Glu Arg Ala Asp Ile Ala Glu Ser Gln Val Asn 
910 1915 1920 


Lys Leu Arg Ala Lys Ser Arg Asp Val Gly Ser Gln Val Arg Ala 
925 1930 1935 


Asp Thr Trp Ala Trp Leu Leu Leu Lys Glu Ser Arg Asp Pro Gly 
940 1945 1950 


Ala Gln Arg Ser His Arg Glu Ser Leu Glu Ala Ala His Trp Ser 
955 1960 1965 


Leu Leu Leu Val Met Pro Ser Gln Met Pro Pro Thr Gin Gln Arg 
970 1975 1980 


Leu Leu 
985 
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What is claimed is: 

1. Amethod of synthesizing extinct muscle derived tissue, 
comprising: 

selecting an extinct genome comprising at least a portion 

of a genome of an extinct species; 

selecting a foundational genome comprising at least a 

portion of a genome of a non-extinct ancestor to the 
extinct species; 

identifving at least one absent gene, the at least one absent 

gene comprising genes present in the extinct genome 
and absent from the foundational genome; 

combining the foundational genome and the at least one 

absent gene to generate a hybridized genome; 
expressing the at least one absent gene of the hybridized 
genome in a growing cell mass; and 

preparing the grown cell mass for consumption as a 

cell-based meat product. 

2. The method of claim 1, wherein the extinct species is 
a mammoth and the non-extinct ancestor is an elephant. 

3. The method of claim 1, wherein the extinct species is 
a dinosaur and the non-extinct ancestor is a chicken. 

4. The method of claim 1. wherein the at least one absent 
gene comprises only genes that support muscle tissue 
growth. 

5. The method of claim 3, wherein the at least one absent 
gene comprises collagen genes. 

6. The method of claim 2, wherein the at least one absent 
gene comprises myosin genes. 

7. The method of claim 1, wherein the at least one absent 
gene comprises actin, creatine kinase, tropomyosin, 
fibronectin, mvoglobin, or keratin. 

8. The method of claim 1, wherein the grown cell mass 
comprises cells of the non-extinct ancestor transfected with 
the hvbridized genome. 

9. The method of claim 1, wherein the grown cell mass 
comprises cells derived from chicken transfected with the 
hybridized genome. 

10. The method of claim 9, wherein the hybridized 
genome is codon optimized for expression by cultured 
chicken cells. 

11. The method of claim 1, further comprising expressing 
the foundational genes of the hybridized genome in the 
growing cell mass. 

12. The method of claim 1, wherein the at least one absent 
gene is overexpressed using a promoter. 

13. The method of claim 1, wherein the extinct genome is 
comprised of at least a portion of two or more extinct 
species. 

14. A method of synthesizing extinct muscle derived 
tissue, comprising: 

selecting an extinct genome comprising at least a portion 

of a genome of an extinct species; 

selecting a foundational genome comprising at least a 

portion of a genome of a non-extinct ancestor to the 
extinct species; 
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identifying at least one gene present in the extinct genome 
that exhibits distinct expression from a sister gene in 
the foundational genome; 

substituting the sister gene of the foundational genome 

with the at least one extinct genome-specific gene to 
generate a hybridized genome; 

expressing the at least one extinct genome-specific gene 

of the hybridized genome in a growing cell mass; and 
preparing the grown cell mass for consumption. 

15. The method of claim 14, wherein the extinct species 
is a mammoth and the non-extinct ancestor is an elephant. 

16. The method of claim 14, wherein the extinct species 
is a dinosaur and the non-extinct ancestor is a chicken. 

17. The method of claim 14, wherein the sister gene 
comprises only genes that support muscle tissue growth. 

18. The method of claim 16, wherein the sister gene 
comprises collagen genes. 

19. The method of claim 15, wherein the sister gene 
comprises myosin genes. 

20. The method of claim 14, wherein the sister gene 
comprises actin, creatine kinase, tropomyosin, fibronectin, 
myoglobin, or keratin. 

21. The method of claim 14, wherein the grown cell mass 
comprises cells of the non-extinct ancestor transfected with 
the hybridized genome. 

22. The method of claim 14, wherein the grown cell mass 
comprises cells derived from chicken transfected with the 
hybridized genome. 

23. The method of claim 22, wherein the hybridized 
genome is codon optimized for expression by cultured 
chicken cells. 

24. The method of claim 14, further comprising express- 
ing the foundational genes of the hybridized genome in the 
growing cell mass. 

25. The method of claim 14, wherein the at least one 
absent gene is overexpressed using a promoter. 

26. The method of claim 14, wherein the extinct genome 
is comprised of at least a portion of two or more extinct 
species. 

27. A cell-based food product for consumption, compris- 
ing extinct muscle derived tissue, the extinct muscle derived 
tissue comprising at least one gene from an extinct species 
and at least one gene from a non-extinct ancestor to the 
extinct species; wherein the at least one gene from the 
extinct species is absent from the genome of the non-extinct 
ancestor. 

28. A cell-based food product for consumption, compris- 
ing an extinct muscle derived tissue, the extinct muscle 
derived tissue comprising at least one gene from an extinct 
species and at least one gene from a non-extinct ancestor to 
the extinct species; wherein the at least one gene from the 
extinct species is substituted for a sister gene from the 
non-extinct ancestor. 


